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NOVEL HUMAN TRANSPORTER PROTEINS AND 
POLYNUCLEOTIDES ENCODING THE SAME 

The present application claims the benefit of U.S. 
5 Provisional Application Number 60/185,956 which was filed on 
February 29, 2000 and is herein incorporated by reference in 
its entirety. 

1. INTRODUCTION 
The present invention relates to the discovery, 
10 identification, and characterization of novel human 
polynucleotides encoding proteins that share sequence 
similarity with mammalian transporter proteins. The invention 
encompasses the described polynucleotides, host cell expression 
systems, the encoded proteins, fusion proteins, polypeptides 

15 and peptides, antibodies to the encoded proteins and peptides, 
and genetically engineered animals that either lack or over 
express the disclosed polynucleotides, antagonists and agonists 
of the proteins, and other compounds that modulate the 
expression or activity of the proteins encoded by the disclosed 

20 polynucleotides that can be used for diagnosis, drug screening, 
clinical trial monitoring, and treatment of diseases and 
disorders . 



2. BACKGROUND OF THE INVENTION 
Transporter proteins are integral membrane proteins that 

25 mediate or facilitate the passage of materials across the lipid 
bilayer. Given that the transport of materials across the 
membrane can play an important physiological role, transporter 
proteins are good drug targets. Additionally/ one of the 
mechanisms of drug resistance involves diseased cells using 

30 cellular transporter systems to export chemotherapeutic agents 
from the cell- Such mechanisms are particularly relevant to 
cells manifesting resistance to a multiplicity of drugs. 
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3. SUMMARY OF THE INVENTION 
The present invention relates to the discovery, 
- identification, and characterization of nucleotides that encode 
novel human proteins, and the corresponding amino acid 
5 sequences of these proteins. The novel human proteins (NHPs) 
described for the first time herein share structural similarity 
with mammalian ion transporters, calcium transporters 
(particularly calcixam transporting ATPases), sulfate 
transporters, and zinc transporters. 

10 The novel human nucleic acid sequences described herein, 

encode alternative proteins/open reading frames (ORFs) of 1^177 
and 374 amino acids in length (calcium-transporting ATPase, SEQ 
ID NOS: 2 and 4), 970 (sulfate transporter, SEQ ID N0:7), and 
507 (zinc transporter, SEQ ID NO: 10) amino acids in length. 

15 The invention also encompasses agonists and antagonists of 

the described NHPs, including small molecules, large molecules, 
mutant NHPs, or portions thereof, that compete with native NHP, 
peptides, and antibodies, as well as nucleotide sequences that 
can be used to inhibit the expression of the described NHPs 

20 (e.g., antisense and ribozyme molecules, and gene or regulatory 
sequence replacement constructs) or to enhance the expression 
of the described NHP polynucleotides (e.g., expression 
constructs that place the described polynucleotide under the 
control of a strong promoter system), and transgenic animals 

25 that express a NHP transgene, or "knock-outs" (which can be 
conditional) that do not express a functional NHP. ECnock-out 
mice can be produced in several ways, one of which involves the 
use of mouse embryonic stem cells ("ES cells") lines that 
contain gene trap mutations in a murine homolog of at least one 

30 of the described NHPs. When the unique NHP sequences 
described in SEQ ID NOS: 1-11 are ''knocked-out" they 
provide a method of identifying phenotypic expression of 
the particular gene as well as a method of assigning 
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function to previously unknown genes. Additionally, the 
unique HHP sequences described in SEQ ID N0S:1-11 are 
useful for the identification of coding sequence and the 
mapping a unique gene to a particular chromosome. 

Further, the present invention also relates to processes 
for identifying compounds that modulate, i.e., act as agonists 
or antagonists, of HHP expression and/or NHP activity that 
utilize purified preparations of the described HHPs and/or NHP 
product, or cells expressing the same. Such compounds can be 
used as therapeutic agents for the treatment of any of a wide 
variety of symptoms associated with biological disorders or 
imbalances . 



4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
L5 The Sequence Listing provides the sequences of the 

described NHP ORFs that encode the described NHP amino acid 
sequences. SEQ ID NOS 5, 8, and 11 describe nucleotides 
encoding NHP ORFs along with regions of flanking sequence. 

20 5. DETAILED DESCRIPTION OF THE INVENTION 

The NHPs described for the first time herein are novel 
proteins that may be expressed in, inter alia, human cell 
lines, fetal brain, pituitary, cerebellum, thymus, spleen, 
lymph node, bone marrow, trachea, kidney, fetal liver, liver, 

25 prostate, testis, thyroid, adrenal gland, salivary gland, 

stomach, small intestine, colon, adipose, rectum, pericardium, 
bone marrow, placenta, and gene trapped human cells. More 
particularly, the NHP that is similar to sulfate transporters 
(and the down-regulated in adenoma, or DRA, gene) is 

30 predominantly found in bone marrow and testis, and the zinc 
transporter-like NHP can be found expressed in the placenta. 

The present invention encompasses the nucleotides 
presented in the Sequence Listing, host cells expressing such 
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nucleotides, the expression products of such nucleotides, and: 
(a) nucleotides that encode mammalian homologs of the described 
polynucleotides, including the specifically described NHPs, and 
the NHP products; (b) nucleotides that encode one or more 
5 portions of the NHPs that correspond to functional domains, and 
the polypeptide products specified by such nucleotide 
sequences, including but not limited to the novel regions of 
any active domain (s); (c) isolated nucleotides that encode 
mutant versions, engineered or naturally occurring, of the 
10 described NHPs in which all or a part of at least one domain is 
deleted or altered, and the polypeptide products specified by 
such nucleotide sequences, including but not limited to soluble 
proteins, and peptides in which all or a portion of the signal 
(or hydrophobic transmembrane) sequence is deleted; (d) 
15 nucleotides that encode chimeric, fusion proteins containing all 
or a portion of a coding region of an NHP, or one of its 
domains (e.g., a receptor or ligand binding domain, accessory 
protein/self -association domain, etc.) fused to another peptide 
or polypeptide; or (e) therapeutic or diagnostic derivatives of 
20 the described polynucleotides such as oligonucleotides, 

antisense polynucleotides, ribozymes, dsRNA, or gene therapy 
constructs comprising a sequence first disclosed in the 
Sequence Listing. As discussed above, the present invention 
includes: (a) the human DNA sequences presented in the Sequence 
25 Listing (and vectors comprising the same) and additionally 

contemplates any nucleotide sequence encoding a contiguous NHP 
open reading frame (ORF) that hybridizes to a complement of a 
DNA sequence presented in the Sequence Listing under highly 
stringent conditions, e.g., hybridization to filter-bound DNA 
30 in 0.5 M NaHP04, 7% sodium dodecyl sulfate (SDS), 1 inM EDTA at 
65'C, and washing in O.lxSSC/0.1% SDS at 68'C (Ausubel F.M. et 
al., eds., 1989, Current Protocols in Molecular Biology, Vol. 
I, Green Publishing Associates, Inc., and John Wiley & sons. 
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Inc., New York, at p. 2.10.3) and encodes a functionally 
equivalent gene product. Additionally contemplated are any 
nucleotide sequences that hybridize to the complement of a DNA 
sequence that encodes and expresses an amino acid sequence 
5 presented in the Sequence Listing under moderately stringent 
conditions, e.g., washing in 0.2xSSC/0.1% SDS at 42-C (Ausubel 
et al., 1989, supra), yet still encodes a functionally 
equivalent NHP product. Functional equivalents of a HHP 
include naturally occurring NHPs present in other species and 
10 mutant NHPs whether naturally occurring or engineered (by site 
directed mutagenesis, gene shuffling, directed evolution as 
described in, for example, U.S. Patent No. 5,837,458). The 
invention. also includes degenerate nucleic acid variants of the 
disclosed NHP polynucleotide sequences. 
15 Additionally contemplated are polynucleotides encoding NHP 

ORFs, or their functional equivalents, encoded by 
polynucleotide sequences that are about 99, 95, 90, or about 
85 percent similar or identical to corresponding regions of the 
nucleotide sequences of the Sequence Listing (as measured by 
20 BLAST sequence comparison analysis using, for example, the GCG 
sequence analysis package using standard default settings) . 

The invention also includes nucleic acid molecules, 
preferably DNA molecules, that hybridize to, and are therefore 
the complements of, the described NHP nucleotide sequences. 
25 Such hybridization conditions may be highly stringent or less 
highly stringent, as described above. In instances where the 
nucleic acid molecules are deoxyoligonucleotides ("DNA 
oligos") , such molecules are generally about 16 to about 100 
bases long, or about 20 to about 80, or about 34 to about 45 
30 bases long, or any variation or combination of sizes 

represented therein that incorporate a contiguous region of 
sequence first disclosed in the Sequence Listing. Such 
oligonucleotides can be used in conjunction with the polymerase 
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Chain reaction (PGR) to screen libraries, isolate clones, and 
prepare cloning and sequencing templates, etc. 

Alternatively, such NHP oligonucleotides can be used as 
hybridization probes for screening libraries, and assessing 
5 gene expression patterns (particularly using a micro array or 
high-throughput ^^chip" format) . Additionally, a series of the 
described NHP oligonucleotide sequences, or the complements 
thereof, can be used to represent all or a portion of the 
described NHP sequences. An oligonucleotide or polynucleotide 
10 sequence first disclosed in at least a portion of one or more 
of the sequ^ces of SEQ ID NOS: 1-11 can be used as a 
hybridization probe in conjunction with a solid support 
matrix/substrate (resins, beads, membranes, plastics, polymers, 
metal or metallized substrates, crystalline or polycrystalline 
15 substrates, etc-). Of particular note are spatially 

addressable arrays (i.e., gene chips, microtiter plates, etc.) 
of oligonucleotides and polynucleotides, or corresponding 
oligopeptides and polypeptides, wherein at least one of the 
biopolymers present on the spatially addressable array 
20 comprises an oligonucleotide or polynucleotide sequence first 
disclosed in at least one of the sequences of SEQ ID NOS: 1-11, 
or an amino acid sequence encoded thereby. Methods for 
attaching biopolymers to, or synthesizing biopolymers on, solid 
support matrices, and conducting binding studies thereon are 
25 disclosed in, inter alia, U.S. Patent Nos. 5,700,637, 
5,556,752, 5,744,305, 4,631,211, 5,445,934, 5,252,743, 
4,713,326, 5,424,186, and 4,689,405 the disclosures of which 
are herein incorporated by reference in their entirety. 

Addressable arrays comprising sequences first disclosed in 
30 SEQ ID NOS: 1-11 can be used to identify and characterize the 
temporal and tissue specific expression of a gene. These 
addressable arrays incorporate oligonucleotide sequences of 
sufficient length to confer the required specificity, yet be 
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within the limitations of the production technology. The 
length of these probes is within a range of between about 8 to 
about 2000 nucleotides. Preferably the probes consist of 60 
nucleotides and more preferably 25 nucleotides from the 
5 sequences first disclosed in SEQ ID N0S:1-11. 

For example, a series of the described oligonucleotide 
sequences, or the complements thereof, can be used in chip 
format to represent all or a portion of the described 
sequences. The oligonucleotides, typically between about 16 to 
10 about 40 (or any whole number within the stated range) 

nucleotides in length can partially overlap each other and/or 
the sequence may be represented using oligonucleotides that do 
not overlap. Accordingly, the described polynucleotide 
sequences shall typically comprise at least about two or three 
15 distinct oligonucleotide sequences of at least about 8 

nucleotides in length that are each first disclosed in the 
described Sequence Listing. Such oligonucleotide sequences can 
begin at any nucleotide present within a sequence in the 
Sequence Listing and proceed in either a sense (5'-to-3*) 
20 orientation vis-a-vis the described sequence or in an antisense 
orientation. 

Microarray-based analysis allows the discovery of broad 
patterns of genetic activity, providing new understanding of 
gene functions and generating novel and unexpected insight into 

25 transcriptional processes and biological mechanisms. The use of 
addressable arrays comprising sequences first disclosed in SEQ 
ID N0S:1-11 provides detailed information about transcriptional 
changes involved in a specific pathway, potentially leading to 
the identification of novel components or gene functions that 

30 manifest themselves as novel phenotypes. 

Probes consisting of sequences first disclosed in SEQ ID 
N0S:1-11 can also be used in the identification, selection and 
validation of novel molecular targets for drug discovery. The 
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use of these unique sequences permits the direct confirmation 
of drug targets and recognition of drug dependent changes in 
gene expression that are modulated through pathways distinct 
from the drugs intended target. These unique sequences 
5 therefore also have utility in defining and monitoring both 
drug action and toxicity. 

As an example of utility, the sequences first disclosed in 
SEQ ID N0S:1-11 can be utilized in microarrays or other assay 
formats, to screen collections of genetic material from 
10 patients who have a particular medical condition. These 
investigations can also be carried out using the sequences 
first disclosed in SEQ ID N0S:1-11 in silico and by comparing 
previously collected genetic databases and the disclosed 
sequences using computer software known to those in the art. 
15 Thus the sequences first disclosed in SEQ ID N0S:1-11 can 

be used to identify mutations associated with a particular 
disease and also as a diagnostic or prognostic assay. 

Although the presently described sequences have been 
specifically described using nucleotide sequence, it should be 
20 appreciated that each of the sequences can uniquely be 

described using any of a wide variety of additional structural 
attributes, or combinations thereof. For example, a given 
sequence can be described by the net composition of the 
nucleotides present within a given region of the sequence in 
25 conjunction with the presence of one or more specific 

oligonucleotide sequence (s). first disclosed in the SEQ ID NOS: 
1-11. Alternatively, a restriction map specifying the relative 
positions of restriction endonuclease digestion sites, or 
various palindromic or other specific oligonucleotide sequences 
30 can be used to structurally describe a given sequence. Such 
restriction maps, which are typically generated by widely 
available computer programs (e.g., the University of Wisconsin 
GCG sequence analysis package, SEQUENCHER 3.0, Gene Codes 
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Corp., Ann Arbor, MI, etc.), can optionally be used in 
conjunction with one or more discrete nucleotide sequence (s) 
present in the sequence that can be described by the relative 
position of the sequence relatve to one or more additional 
5 sequence (s) or one or more restriction sites present in the 
disclosed sequence. 

For oligonucleotide probes, highly stringent conditions 
may refer, e.g., to washing in 6xSSC/0.05% sodium pyrophosphate 
at 3T'C (for 14-base oligos) , 48-0 (for 17-base oligos) , 55°C 
10 (for 20-base oligos), and GCC (for 23-base oligos). These 

nucleic acid molecules may encode or act as NHP gene antisense 
molecules, useful, for example, in NHP gene regulation (for 
and/or as antisense primers in amplification reactions of NHP 
gene nucleic acid sequences). With respect to NHP gene 
15 regulation, such techniques can be used to regulate biological 
functions. Further, such sequences may be used as part of 
ribozyme and/or triple helix sequences that are also useful for 
NHP gene regulation. 

Inhibitory antisense or double stranded oligonucleotides 
20 can additionally comprise at least one modified base moiety 
which is selected from the group including but not limited to 
5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xantine, 4-acetylcytosine, 

5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
25 2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, 
beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2, 2 -dimethyl guanine, 

2- methyladenine, 2-methylguanine, 3-methylcytosine, 
5-methylcytosine, N6-adenine, 7-methylguanine, 

30 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine , 5'-methoxycarboxymethyluracil, 
5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5- 
oxyacetic acid (v) , wybutoxosine, pseudouracil, queosine. 
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2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 
4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid 
methylester, uracil-5-oxyacetic acid (v) , 5-methyl- 
2-thiouracil, 3- (3-aiaino-3-N-2-carboxypropyl) uracil, (acp3)w, 
and 2, 6-diaminopurine. 

The antisense oligonucleotide can also comprise at least 
one modified sugar moiety selected from the group including but 
not limited to arabinose, 2-fluoroarabinose, xylulose, and 
hexose. 

In yet another embodiment, the antisense oligonucleotide 
will comprise at least one modified phosphate backbone selected 
from the group consisting of a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, a phosphoramidate, 
a phosphordiamidate, a methylphosphonate , an alkyl 
15 phosphotriester, and a formacetal or analog thereof. 

In yet another embodiment, the antisense oligonucleotide 
is an a-anomeric oligonucleotide. An a-anomeric 
oligonucleotide forms specific double-stranded hybrids with 
complementary RNA in which, contrary to the usual p-units, the 
20 strands run parallel to each other (Gautier et al., 1987, Nucl. 
Acids Res. 15:6625-6641). The oligonucleotide is a 2'-0- 
methylribonucleotide (Inoue et al., 1987, Nucl. Acids Res. 
15:6131-6148), or a chimeric RNA-DNA analogue (Inoue et al., 
1987, FEBS Lett. 215:327-330). Alternatively, double stranded 
25 RNA can be used to disrupt the expression and function of a 
targeted NHP. 

Oligonucleotides of the invention can be synthesized by 
standard methods known in the art, e.g. by use of an automated 
DNA synthesizer (such as are commercially available from 
30 Biosearch, Applied Biosystems, etc.). As examples, 

phosphorothioate oligonucleotides can be synthesized by the 
method of Stein et al. (1988, Nucl. Acids Res. 1^:3209), and 
methylphosphonate oligonucleotides can be prepared by use of 
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controlled pore glass polymer supports (Sarin et al., 1988, 
Proc. Natl. Acad. Sci. U.S.A. 65:7448-7451), etc. 

Low stringency conditions are well known to those of skill 
in the art, and will vary predictably depending on the specific 

5 organisms from which the library and the labeled sequences are 
derived. For guidance regarding such conditions see, for 
example, Sambrook et al., 1989, Molecular Cloning, A Laboratory 
Manual (and periodic updates thereof) , Cold Springs Harbor 
Press, N.Y.; and Ausubel et al.. 1989, Current Protocols in 

10 Molecular Biology, Green Publishing Associates and Wiley 

InterscienceT N.Y. 

Alternatively, suitably labeled NHP nucleotide probes can 
be used to screen a human genomic library using appropriately 
stringeiit conditions or by PGR. The identification and 
15 characterization of human genomic clones is helpful for 
identifying polymorphisms (including, but not limited to, 
nucleotide repeats, microsatellite alleles, single nucleotide 
polymorphisms, or coding single nucleotide polymorphisms), 
determining the genomic structure of a given locus/allele, and 
20 designing diagnostic tests. For example, sequences derived 
from regions adjacent to the intron/exon boundaries of the 
human gene can be used to design primers for use in 
amplification assays to detect mutations within the exons, 
introns, splice sites (e.g., splice acceptor and/or donor 
25 sites) , etc. , that can be used in diagnostics and 
pharmacogenomics . 

Further, a NHP gene hpmolog can be isolated from nucleic 
acid from an organism of interest by performing PGR using two 
degenerate or "wobble" oligonucleotide primer pools designed on 
30 the basis of amino acid sequences within the NHP products 

disclosed herein. The template for the reaction may be total 
RNA, mRNA, and/or cDNA obtained by reverse transcription of 
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mRNA prepared from human or non-human cell lines or tissue 
known or suspected to express an allele of a NHP gene. 

The PGR product can be subcloned and sequenced to ensure 
that the amplified sequences represent the sequence of the 
5 desired NHP gene. The PGR fragment can then be used to isolate 
a full length cDNA clone by a variety of methods. For example, 
the amplified fragment can be labeled and used to screen a cDNA 
library, such as a bacteriophage cDNA library. Alternatively, 
the labeled fragment can be used to isolate genomic clones via 
10 the screening of a genomic library. 

PGR technology can also be used to isolate full length 
cDNA sequences. For example, RNA can be isolated, following 
standard procedures, from an appropriate cellular or tissue 
source (i.e., one known, or suspected, to express a NHP gene). 
15 A reverse transcription (RT) reaction can be performed on the 
RNA using an oligonucleotide primer specific for the most 5' 
end of the an?)lified fragment for the priming of first strand 
synthesis. The resulting RNA/DNA hybrid may then be "tailed" 
using a standard terminal transferase reaction, the hybrid may 
20 be digested with RNase H, and second strand synthesis may then 
be primed with a complementary primer. Thus, cDNA sequences 
upstream of the amplified fragment can be isolated. For a 
review of cloning strategies that can be used, see e.g., 
Sambrook et al., 1989, supra. 
25 A cDNA encoding a mutant NHP gene can be isolated, for 

example, by using PGR. In this case, the first cDNA strand may 
be synthesized by hybridizing an oligo-dT oligonucleotide to 
mRNA isolated from tissue known or suspected to be expressed in 
an individual putatively carrying a mutant NHP allele, and by 
30 extending the new strand with reverse transcriptase. The 
second strand of the cDNA is then synthesized using an 
oligonucleotide that hybridizes specifically to the 5' end of 
the normal gene. Using these two primers, the product is then 
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amplified via PGR, optionally cloned into a suitable vector, 
and subjected to DNA sequence analysis through methods well 
known to those of skill in the art. By comparing the DNA 
sequence of the mutant NHP allele to that of a corresponding 
5 normal NHP allele, the mutation (s) responsible for the loss or 
alteration of function of the mutant NHP gene product can be 
ascertained. 

Alternatively, a genomic library can be constructed using 
DNA obtained from an individual suspected of or known to carry 
10 a mutant NHP allele (e.g., a person manifesting a NHP- 

associated phenotype such as, for example, obesity, high blood 
pressure, connective tissue disorders, infertility, etc.), or a 
cDNA library can be constructed using RNA from a tissue known, 
or suspected, to express a mutant NHP allele. A normal NHP 
15 gene, or any suitable fragment thereof, can then be labeled and 
used as a probe to identify the corresponding mutant NHP allele 
in such libraries. Clones containing mutant NHP gene sequences 
can then be purified and subjected to sequence analysis 
according to methods well known to those skilled in the art. 
20 Additionally, an expression library can be constructed 

utilizing cDNA synthesized from, for example, RNA isolated from 
a tissue known, or suspected, to express a mutant NHP allele in 
an individual suspected of or known to carry such a mutant 
allele. In this manner, gene products made by the putatively 
25 mutant tissue can be expressed and screened using standard 
antibody screening techniques in conjunction with antibodies 
raised against a normal NHP product, as described below. (For 
screening techniques, see, for example, Harlow, E. and Lane, 
eds., 1988, "Antibodies: A Laboratory Manual", Cold Spring 
30 Harbor Press, Cold Spring Harbor, NY) . 

Additionally, screening can be accomplished by screening 
with labeled NHP fusion proteins, such as, for example, 
alkaline phosphatase-NHP or NHP-alkaline phosphatase fusion 
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proteins. In cases where a NHP mutation results in an 
expressed gene product with altered function (e.g., as a result 
of a missense or a frameshift mutation), polyclonal antibodies 
to a NHP are likely to cross-react with a corresponding mutant 
5 NHP gene product. Library clones detected via their reaction 
with such labeled antibodies can be purified and subjected to 
sequence analysis according to methods well known in the art. 

The invention also encompasses (a) DNA vectors that 
contain any of the foregoing NHP coding sequences and/or their 
10 complements (i.e., antisense) ; (b) DNA expression vectors that 
contain any of the foregoing NHP coding sequences operatively 
associated with a regulatory element that directs the 
expression of the coding sequences (for example, baculo virus 
as described in U.S. Patent No. 5,869,336 herein incorporated 
15 by reference); (c) genetically engineered host cells that 

contain any of the foregoing NHP coding sequences operatively 
associated with a regulatory element that directs the 
expression of the coding sequences in the host cell; and (d) 
genetically engineered host cells that express an endogenous 
20 NHP gene under the control of an exogenously introduced 

regulatory element (i.e., gene activation). As used herein, 
regulatory elements include, but are not limited to, inducible 
and non-inducible promoters, enhancers, operators and other 
elements known to those skilled in the art that drive and 
25 regulate expression. Such regulatory elements include but are 
not limited to the cytomegalovirus (hCMV) immediate early gene, 
regulatable, viral elements (particularly retroviral LTR 
promoters) , the early or late promoters of SV40 adenovirus, the 
lac system, the trp system, the TAC system,, the TRC system, the 
30 major operator and promoter regions of phage lambda, the 
control regions of fd coat protein, the promoter for 
3-phosphoglycerate kinase (PGK), the promoters of acid 
phosphatase, and the promoters of the yeast a-mating factors. 
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The present invention also encompasses antibodies and 
anti-idiotypic antibodies (including Fab fragments), 
antagonists and agonists of the NHP, as well as compounds or 
nucleotide constructs that inhibit expression of a NHP gene 
5 (transcription factor inhibitors, antisense and ribozyme 
molecules, or gene or regulatory sequence replacement 
constructs), or promote the expression of a NHP (e.g., 
expression constructs in which NHP coding sequences are 
operatively associated with expression control elements such as 
10 promoters, promoter /enhancers, etc.). 

The NHPs or NHP peptides, NHP fusion proteins, NHP 
nucleotide sequences, antibodies, antagonists and agonists can 
be useful for the detection of mutant NHPs or inappropriately 
expressed NHPs for the diagnosis of disease- The NHP proteins 
15 or peptides, NHP fusion proteins, NHP nucleotide sequences, 

host cell expression systems, antibodies, antagonists, agonists 
and genetically engineered cells and animals can be used for 
screening for drugs (or high throughput screening of 
combinatorial libraries) effective in the treatment of the 
20 symptomatic or phenotypic manifestations of perturbing the 

normal function of NHP in the body. The use of engineered host 
cells and/or animals may offer an advantage in that such 
systems allow not only for the identification of compounds that 
bind to the endogenous receptor for an NHP, but can also 
25 identify compounds that trigger NHP-mediated activities or 
pathways - 

Finally, the NHP products can be used as therapeutics. 
For example, soluble derivatives such as NHP peptides /domains 
corresponding to NHPs, NHP fusion protein products (especially 
30 NHP-Ig fusion proteins, i.e., fusions of a NHP, or a domain of 
a NHP, to an IgFc) , NHP antibodies and anti-idiotypic 
antibodies (including Fab fragments), antagonists or agonists 
(including compounds that modulate or act on downstream targets 
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in a NHP-mediat6d pathway) can be used to directly treat 
diseases or disorders. For instance, the administration of an 
effective amount of soluble NHP, or a NHP-IgFc fusion protein 
or an anti-idiotypic antibody (or its Fab) that mimics the NHP 
could activate or effectively antagonize the endogenous NHP 
receptor. Nucleotide constructs encoding such NHP products can 
be used to genetically engineer host cells to express such 
products in vivo; these genetically engineered cells function 
as "bioreactors" in the body delivering a continuous supply of 
a NHP, a NHP peptide, or a NHP fusion protein to the body. 
Nucleotide constructs encoding functional NHPs, mutant NHPs, as 
well as antisense and ribozyme molecules can also be used in 
"gene therapy" approaches for the modulation of NHP expression. 
Thus, the invention also encompasses pharmaceutical 
formulations and methods for treating biological disorders. 

Various aspects of the invention are described in greater 
detail in the subsections below. 

5.1 THE NHP SEQUENCES 
The cDNA sequences and the corresponding deduced amino 
acid sequences of the described NHPs are presented in the 
Sequence Listing. The NHP nucleotides were obtained from 
clustered human gene trapped sequences, testis and mammary 
transcript RACE products, ESTs, and human brain, testis, 
trachea, pituitary, thymus, and mammary gland cDNA libraries 
(Edge Biosystems, Gaithersburg, MD) . 

SEQ ID N0S:l-5 describe sequences that are similar to 
eucaryotic ATP-driven ion pumps such as calcium transporting 
ATPases, and which can be found expressed in a variety of human 
I cells and tissues. The described sequences were assembled 
using gene trapped sequences and clones isolated from human 
kidney, lymph node, and thymus cDNA libraries (Edge Biosystems, 
Gaithersburg, MD) . 
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SEQ ID NOS: 6-8 describe sequences that are similar to, 
inter alia, sulfate transporter and cotransporter proteins, and 
can be found expressed in human bone marrow and testis, 
several polymorphisms were found in this NHP including, but not 

5 limited to, possible A-to-G transitions at nucleotide positions 
corresponding to nucleotides 589, 692, 917, 1,164, and 2,390 
of, for example SEQ ID NO: 8 which be silent or can result in 
the met corresponding to amino acid position 73 of SEQ ID N0:7 
converting to a val (e.g., met 73 converting to val 73), val 
10 148 converting to ile, asn 230 converting to lys, ile 562 
converting to val. An additional C-to-T transition was 
identified that converts ala 777 to val. SEQ ID NOS: 6-8 can be 
expressed in bone marrow and predominantly in testis cells. 
These NHPs were assembled from gene trapped sequences and 

15 clones from a human testis cDNA library (Edge Biosystems, 

Gaithersburg, MD) . 

SEQ ID NOS: 9-11 describe sequences that are similar to 
zinc transporters and vesicular transporters, can be found 
expressed in, inter alia, placenta and adrenal gland, and these 
20 NHP sequences were assembled using gene trapped sequences and 
clones from human adrenal and placenta cDNA libraries (Edge 
Biosystems, Gaithersburg, MD) . 

Transporters and transporter related multidrug resistance 
(MDR) sequences, as well as uses and applications that are 
25 germane to the described NHPs, are described in U.S. Patents 
NOS. 5,198,344 and 5,866,699 which are herein incorporated by 
reference in their entirety. 

5.2 NHPS AND NHP POLYPEPTIDES 
30 NHPs, polypeptides, peptide fragments, mutated, truncated, 

or deleted forms of the NHPs, and/or NHP fusion proteins can be 
prepared for a variety of uses. These uses include but are not 
limited to the generation of antibodies, as reagents in 
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diagnostic assays, the identification of other cellular gene 
products related to a HHP, as reagents in assays for screening 
for compounds that can be as pharmaceutical reagents useful in 
the therapeutic treatment of mental, biological, or medical 
5 disorders and diseases. Given the similarity information and 
expression data, the described HHPs can be targeted (by drugs, 
oligos, antibodies, etc,) in order to treat disease, or to 
therapeutically augment the efficacy of, for example, 
chemotherapeutic agents used in the treatment of breast or 

10 prostate cancer. 

The Sequence Listing discloses the amino acid sequences 
encoded by the described NHP polynucleotides. The NHPs 
typically display have initiator methionines in DNA sequence 
contexts consistent with a translation initiation site. 
15 The NHP amino acid sequences of the invention include the 

amino acid sequence presented in the Sequence Listing as well 
as analogues and derivatives thereof. Further, corresponding 
NHP homologues from other species are encompassed by the 
invention. In fact, any NHP protein encoded by the NHP 
20 nucleotide sequences described above are within the scope of 
the invention, as are any novel polynucleotide sequences 
encoding all or any novel portion of an amino acid sequence 
presented in the Sequence Listing. The degenerate nature of 
the genetic code is well known, and, accordingly, each amino 
25 acid presented in the Sequence Listing, is generically 

representative of the well known nucleic acid ^triplet" codon, 
or in many cases codons, that can encode the amino acid. As 
such, as contemplated herein, the amino acid sequences 
presented in the Sequence Listing, when taken together with the 
30 genetic code (see, for example. Table 4-1 at page 109 of 
^"Molecular Cell Biology", 1986, J. Darnell et al. eds.. 
Scientific American Books, New York, NY, herein incorporated by 
reference) are generically representative of all the various 
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permutations and coinbinations of nucleic acid sequences that 
can encode such amino acid sequences. 

The invention also encompasses proteins that are 
functionally equivalent to the HHPs encoded by the presently 
described nucleotide sequences as judged by any of a number of 
criteria, including, but not limited to, the ability to bind 
and cleave a substrate of a NHP, or the ability to effect an 
identical or complementary downstream pathway, or a change in 
cellular metabolism (e.g., proteolytic activity, ion flux, 
tyrosine phosphorylation, etc.). Such functionally equivalent 
NHP proteins include, but are not limited to, additions or 
substitutions of amino acid residues within the amino acid 
sequence encoded by the NHP nucleotide sequences described 
above, but which result in a silent change, thus producing a 
15 functionally equivalent gene product. Amino acid substitutions 
may be made on the basis of similarity in polarity, charge, 
solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues involved. For example, 
nonpolar (hydrophobic) amino acids include alanine, leucine, 
20 isoleucine, valine, proline, phenylalanine, tryptophan, and 
methionine; polar neutral amino acids include glycine, serine, 
threonine, cysteine, tyrosine, asparagine, and glut amine; 
positively charged (basic) amino acids include arginine, 
lysine, and histidine; and negatively charged (acidic) amino 
25 acids include aspartic acid and glutamic acid. 

A variety of host-expression vector systems can be used to 
express the NHP nucleotide sequences of the invention. Where, 
as in the present instance, the NHP peptide or polypeptide is 
thought to be membrane protein, the hydrophobic regions of the 
30 protein can be excised and the resulting soluble peptide or 
polypeptide can be recovered from the culture media. Such 
expression systems also encompass engineered host cells that 
express a NHP, or functional equivalent, in situ. Purification 
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or enrichment of a NHP from such expression systems can be 
accomplished using appropriate detergents and lipid micelles 
and methods well known to those skilled in the art. However, 
such engineered host cells themselves may be used in situations 
5 where it is important not only to retain the structural and 
functional characteristics of the NHP, but to assess biological 
activity, e.g., in drug screening assays. 

The expression systems that may be used for purposes of 
the invention include but are not limited to microorganisms 
10 such as bacteria (e.g., E. coli, B. svbtilis) transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors containing NHP nucleotide sequences; yeast 
(e.g., Saccharomyces, Pichia) transformed with recombinant 
yeast expression vectors containing NHP nucleotide sequences; 
15 insect cell systems infected with recombinant virus expression 
vectors (e.g., baculovirus) containing NHP sequences; plant 
cell systems infected with recombinant virus expression vectors 
(e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, 
TMV) or transformed with recombinant plasmid expression vectors 
20 (e.g., Ti plasmid) containing NHP nucleotide sequences; or 
mammalian cell systems (e.g., COS, CHO, BHK, 293, 3T3) 
harboring recombinant expression constructs containing 
promoters derived from the genome of mammalian cells (e.g., 
metallothionein promoter) or from mammalian viruses (e.g., the 
25 adenovirus late promoter; the. vaccinia virus 7.5K promoter). 

In bacterial systems, a number of expression vectors may 
be advantageously selected depending upon the use intended for 
the NHP product being expressed. For example, when a large 
quantity of such a protein is to be produced for the generation 
30 of pharmaceutical compositions of or containing NHP, or for 

raising antibodies to a NHP, vectors that direct the expression 
of high levels of fusion protein products that are readily 
purified may be desirable. Such vectors include, but are not 

20 
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limited, to the E. coli expression vector paR278 (Ruther et 
al., 1983, EMBO J. 2:1791), in which a NHP coding sequence may 
be ligated individually into the vector in frame with the lacZ 
coding region so that a fusion protein is produced; pIN vectors 
(Inouye & Inouye, 1985, Nucleic Acids Res. 13:3101-3109; Van 
Heeke & Schuster, 1989, J. Biol. Chem. 264:5503-5509); and the 
like. pGEX vectors (Pharmacia or American Type Culture 
Collection) can also be used to express foreign polypeptides as 
fusion proteins with glutathione S-transf erase (GST) . In 
general, such fusion proteins are soluble and can easily be 
purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. 
The PGEX vectors are designed to include thrombin or factor Xa 
protease cleavage sites so that the cloned target gene product 
can be released from the GST moiety. 

In an insect system, Autographa californica nuclear 
polyhidrosis virus (AcNPV) is used as a vector to express 
foreign genes. The virus grows in Spodoptera frugiperda cells. 
A NHP coding sequence may be cloned individually into non- 
essential regions (for example the polyhedrin gene) of the 
virus and placed Under control of an AcNPV promoter (for 
example the polyhedrin promoter) . Successful insertion of NHP 
coding sequence will result in inactivation of the polyhedrin 
gene and production of non-occluded recombinant virus (i.e., 
virus lacking the proteinaceous coat coded for by the 
polyhedrin gene) . These recombinant viruses are then used to 
infect Spodoptera frugiperda cells in which the inserted 
sequence is expressed (e.g., see Smith et al., 1983, J. Virol. 
46:584; Smith, U.S. Patent No. 4,215,051). 

In mammalian host cells, a number of viral-based 
expression systems may be utilized. In cases where an 
adenovirus is used as an expression vector, the NHP nucleotide 
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sequence of interest may be ligated to an adenovirus 
transcription/translation control complex, e.g., the late 
promoter and tripartite leader sequence. This chimeric gene 
may then be inserted in the adenovirus genome by in vitro or in 
5 vivo recombination. Insertion in a non-essential region of the 
viral genome (e.g., region El or E3) will result in a 
recombinant virus that is viable and capable of expressing a 
NHP product in infected hosts (e.g., See Logan & Shenk, 1984, 
Proc. Natl. Acad. Sci. USA 81:3655-3659). Specific initiation 
10 signals may also be required for efficient. translation of 

inserted NHTnucleotide sequences. These signals include the 
ATG initiation codon and adjacent sequences. In cases where an 
entire NHP gene or cDNA, including its own initiation codon and 
adjacenti sequences, is inserted into the appropriate expression 
15 vector, no additional translational control signals may be 

needed. However, in cases where only a portion of a NHP coding 
sequence is inserted, exogenous translational control signals, 
including, perhaps, the ATG initiation codon, must be provided. 
Furthermore, the initiation codon must be in phase with the 
20 reading frame of the desired coding sequence to ensure 
translation of the entire insert. These exogenous 
translational control signals and initiation codons can be of a 
variety of origins, both natural and synthetic. The efficiency 
of expression may be enhanced by the inclusion of appropriate 
25 transcription enhancer elements, transcription terminators, 
etc. (See Bittner at al., 1987, Methods in Enzymol. 153:516- 
544) . 

in addition, a host cell strain may be chosen that 
modulates the expression of the inserted sequences, or modifies 
30 and processes the gene product in the specific fashion desired. 
Such modifications (e.g., glycosylation) and processing (e.g., 
cleavage) of protein products may be important for the function 
of the protein. Different host cells have characteristic and 
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specific mechanisms for the post-translational processing and 
modification of proteins and gene products. Appropriate cell 
lines or host systems can be chosen to ensure the correct 
modification and processing of the foreign protein expressed. 
5 TO this end, eukaryotic host cells which possess the cellular 
machinery for proper processing of the primary transcript, 
glycosylation, and phosphorylation of the gene product may be 
used, such mammalian host cells include, but are not limited 
to, CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, and in 
10 particular, human cell lines. 

For long-term, high-yield production of recombinant 
proteins, stable expression is preferred. For example, cell 
lines which stably express the NHP sequences described above 
can be engineered. Rather than using expression vectors which 
15 contain viral origins of replication, host cells can be 
transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer sequences, 
transcription terminators, polyadenylation sites, etc.), and a 
selectable marker. Following the introduction of the foreign 
20 DNA, engineered cells may be allowed to grow for 1-2 days in an 
enriched media, and then are switched to a selective media. 
The selectable marker in the recombinant plasmid confers 
resistance to the selection and allows cells to stably 
integrate the plasmid into their chromosomes and grow to form 
25 foci which in turn can be cloned and expanded into cell lines. 
This method may advantageously be used to engineer cell lines 
which express the NHP product. Such engineered cell lines may 
be particularly useful in screening and evaluation of compounds 
that affect the endogenous activity of the NHP product. 
30 A number of selection systems may be used, including but 

not limited to the herpes simplex virus thymidine kinase 
(Wigler, et al., 1977, Cell 11:223), hypoxanthine-guanine 
phosphoribosyltransferase (Szybalska t Szybalski, 1962, Proc. 
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Natl. Acad. Sci. USA 48:2026), and adenine 
phosphoribosyltransferase (Lowy, at al., 1980, Cell 22:817) 
genes can be employed in tk", hgprt' or aprt" cells, 
respectively. Also, antimetabolite resistance can be used as 
5 the basis of selection for the following genes: dhfr, which 
confers resistance to methotrexate (Wigler, et al., 1980, Natl. 
Acad. Sci. USA 77:3567; O'Hare, et al., 1981, Proc. Natl. Acad. 
Sci. USA 75:1527); gpt, which confers resistance to 
mycophenolic acid (Mulligan fi Berg, 1981, Proc. Natl. Acad. 
10 Sci. USA 78:2072); neo, which . confers resistance to the 

aminoglycoside G-418 (Colberre-Garapin, et al., 1981, J. Mol. 
Biol. 150:1); and hygrb, which confers resistance to hygromycin 
(Santerre, et al., 1984, Gene 30:147). 

Alternatively, any fusion protein can be readily purified 
15 by utilizing an antibody specific for the fusion protein being 
expressed. For example, a system described by Janlcnecht et al. 
allows for the ready purification of non-denatured fusion 
proteins expressed in human cell lines (Janknecht, et al., 
1991, Proc. Natl. Acad. Sci. USA 88:8972-8976). In this 
20 system, the gene of interest is subcloned into a vaccinia 

recombination plasmid such that the gene's open reading frame 
is translationally fused to an amino-terminal tag consisting of 
six histidine residues. Extracts from cells infected with 
recombinant vaccinia virus are loaded onto Ni^*-nitriloacetic 
25 acid-agarose columns and histidine-tagged proteins are 
selectively eluted with imidazole-containing buffers. 

Also encompassed by the present invention are fusion 
proteins that direct the NHP to a target organ and/or 
facilitate transport across the membrane into the cytosol. 
30 Conjugation of NHPs to antibody molecules or their Fab 

fragments could be used to target cells bearing a particular 
epitope. Attaching the appropriate signal sequence to the NHP 
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would also transport the NHP to the desired location within the 
cell. Alternatively targeting of NHP or its nucleic acid 
sequence might be achieved using liposome or lipid complex 
based delivery systems. Such technologies are described in 
T^. p.c..n....A Annroach, New,RRC ed., Oxford University 

Press, New York and in U.S. Patents Nos. 4,594,595, 5,459,127, 
5,948,767 and 6,110,490 and their respective disclosures which 
are herein incorporated by reference in their entirety. 
Additionally embodied are novel protein constructs engineered 
in such a way that they facilitate transport of the NHP to the 
target site or desired organ. This goal may be achieved by 
coupling of the NHP to a cytokine or other ligand that provides 
targeting specificity, and/or to a protein transducing domain 
(see generally U.S. applications Ser. No. 60/111,701 and 
60/056,713, both of which are herein incorporated by reference, 
for examples of such transducing sequences) to facilitate 
passage across cellular membranes if needed and can optionally 
be engineered to include nuclear localization sequences when 
desired. 

5.3 ANTIBODIES TO NHP PRODUCTS 
Antibodies that specifically recognize one or more 
epitopes of a NHP, or epitopes of conserved variants of a NHP, 
or peptide fragments of a NHP are also encompassed by the 
invention. Such antibodies include but are not limited to 
polyclonal antibodies, monoclonal antibodies (mAbs), humanized 
or chimeric antibodies, single chain antibodies. Fab fragments, 
F(ab')2 fragments, fragments produced by a Fab expression 
library, anti-idiotypic (anti-Id) antibodies, and epitope- 
binding fragments of any of the above. 

The antibodies of the invention may be used, for example, 
in the detection of NHP in a biological sample and may, 
therefore, be utilized as part of a diagnostic or prognostic 
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technique whereby patients may be tested for abnormal amounts 
of NHP. such antibodies may also be utilized in conjunction 
with, for example, compound screening schemes for the 
evaluation of the effect of test compounds on expression and/or 
5 activity of a NHP gene product. Additionally, such antibodxes 
can be used in conjunction gene therapy to, for example, 
evaluate the normal and/or engineered NHP-expressing cells 
prior to their introduction into the patient. Such antibodies 
may additionally be used as a method for the inhibition of 
10 abnormal NHP activity. Thus, such antibodies may, therefore, 
be utilized as part of treatment methods. 

For the production of . antibodies , various host animals may 
be immunized by injection with a NHP, an NHP peptide (e.g., one 
corresponding to a functional domain of an NHP) , truncated NHP 
15 polypeptides (NHP in which one or more domains have been 

deleted) , functional equivalents of the NHP or mutated varxant 
of the NHP. Such host animals may include but are not limited 
to pigs, rabbits, mice, goats, and rats, to name but a few. 
various adjuvants may be used to increase the immunological 
20 response, depending on the host species, including but not 

limited to Freund's adjuvant (complete and incomplete), mineral 
salts such as aluminum hydroxide or aluminum phosphate, surface 
active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, and potentially useful 
25 human adjuvants such as BCG (bacille Calmette-Guerin) and 
CorynebacteriuH. parvum. Alternatively, the immune response 
coold be enhanced by combination and or coupling with molecules 
such as keyhole limpet hemocyanin, tetanus toxoid, diptheria 
toxoid, ovalbumin, cholera toxin or fragments thereof. 
30 Polyclonal antibodies are heterogeneous populations of antibody 
molecules derived from the sera of the iimnunized animals. 

Monoclonal antibodies, which are homogeneous populations 
of antibodies to a particular antigen, can be obtained by any 
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technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These include, 
but are not limited to, the hybridoma technique of Kohler and 
Milstein, (1975, Nature 256:495-497; and O.S. Patent No. 
5 4,376,110), the human B-cell hybridoma technique (Kosbor et 
al., 1983, Immunology Today 4:72; Cole et al., 1983, Proc. 
Natl. Acad. Sci. USA 80:2026-2030), and the EBV-hybridoma 
technique (Cole et al., 1985, Monoclonal Antibodies And Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may 
10 be of any immunoglobulin class including IgG, IgM, IgE, IgA, 
IgD and any'lubclass thereof. The hybridoma producing the mAb 
of this invention may be cultivated in vitro or in vivo. 
Production of high titers of mAbs in vivo makes this the 
presently preferred method of production. ' 
15 in addition, techniques developed for the production of 

"chimeric antibodies" (Morrison et al., 1984, Proc. Natl. Acad. 
Sci., 81:6851-6855; Neuberger et al., 1984, Nature, 312:604- 
608; Takeda et al., 1985, Nature, 514:452-454) by splicing the 
genes from a mouse antibody molecule of appropriate antigen 
20 specificity together with genes from a human antibody molecule 
of appropriate biological activity can be used. A chimeric 
antibody is a molecule in which different portions are derived 
from different animal species, such as those having a variable 
region derived from a murine mAb and a human immunoglobulin 
25 constant region. Such technologies are described in U.S. 
Patents Nos. 6,075,181 and 5,877,397 and their respective 
disclosures which are herein incorporated by reference in their 
entirety. Also encompassed by the present invention is the use 
of fully humanized monoclonal antibodies as described in US 
30 Patent No. 6,150,584 and respective disclosures which are 
herein incorporated by reference in their entirety. 
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Alternatively, techniques described for the production of 
single chain antibodies (U.S. Patent 4,946,778; Bird, 1988, 
science 242:423-426; Huston et al., 1988, Proc. Natl. Acad. 
Sci. USA 85:5879-5883; and Ward at al., 1989, Nature 334:544- 
5 546) can be adapted to produce single chain antibodies against 
NHP gene products. Single chain antibodies are formed by 
linking the heavy and light chain fragments of the Fv region 
via an amino acid bridge, resulting in a single chain 
polypeptide. 

10 Antibody fragments which recognize specific epitopes may 

be generated by known techniques. For example, such fragments 
include, but are not limited to: the F(ab'). fragments which can 
be produced by pepsin digestion of the antibody molecule and 
the Fab fragments which can be generated by reducing the 
15 disulfide bridges of the F(ab'). fragments. Alternatively, Fab 
expression libraries may be constructed (Huse et al., 1989, 
science, 245:1275-1281) to allow rapid and easy identification 
of monoclonal Fab fragments with the desired specificity. 

Antibodies to a NHP can, in turn, be utilized to generate 
20 anti-idiotype antibodies that "mimic" a given NHP, using 

techniques well known to those skilled in the art. (See, e.g., 
Greenspan & Bona, 1993, FASEB J 7f5; :437-444; and Nissinoff, 
1991, J. Immunol. 147^:2429-2438). For example antibodies 
which bind to a NHP domain and competitively inhibit the 
25 binding of NHP to its cognate receptor can be used to generate 
anti-idiotypes that "mimic" the NHP and, therefore, bind and 
activate or neutralize a receptor. Such anti-idiotypic 
antibodies or Fab fragments of such anti-idiotypes can be used 
in therapeutic regimens involving a NHP mediated pathway. 
30 The present invention is not to be limited in scope by the 

specific embodiments described herein, which are intended as 
single illustrations of individual aspects of the invention, 
and functionally equivalent methods and components are within 
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the scope of the invention. Indeed, various modifications of 
the invention, in addition to those shown and described herein 
will become apparent to those skilled in the art from the 
foregoing description. Such modifications are intended to fall 
within the scope of the appended claims. All cited 
publications, patents, and patent applications are herein 
incorporated by reference in their entirety. 
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WHf^T •^'^ rT.aTMF.D IS: 

1. An isolated nucleic acid molecule comprising at 
least 24 contiguous bases of nucleotide sequence first 

5 disclosed in SEQ ID NO: 1. 

2. An isolated nucleic acid molecule comprising a 

nucleotide sequence that: 

(a) encodes the amino acid sequence shown in SEQ ID 

10 NO: 2; and 

(b) hybridizes under stringent conditions to the 
nucleotide sequence of SEQ ID N0:1 or the 
complement thereof. 

3. An isolated nucleic acid molecule comprising a 
nucleotide sequence that encodes the amino acid sequence shown 
in SEQ ID NO: 2. 

4. An isolated nucleic acid molecule comprising a 

20 nucleotide sequence that: 

(a) encodes the amino acid sequence shown in SEQ ID 

NO: 7; and 

(b) hybridizes under stringent conditions to the 
nucleotide sequence of SEQ ID NO: 6 or the 

25 complement thereof. 

5. An isolated nucleic acid molecule comprising a 
nucleotide sequence that encodes the amino acid sequence shown 
in SEQ ID NO: 7. 

30 

6. An isolated nucleic acid molecule comprising a 
nucleotide sequence that: 



30 



WO01/64S78 FCT/USOl/06462 

(a) encodes the amino acid sequence shown in SEQ ID 

NO: 10; and 

(b) hybridizes under stringent conditions to the 
nucleotide sequence of SEQ ID NO: 9 or the 
complement thereof. 

7. . An isolated nucleic acid molecule comprising a 
nucleotide sequence that encodes the amino acid sequence shown 
in SEQ ID NO: 10. 
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SEQOENCB LISTIHG 
<H0> LEXICON GENETICS INCORPORATED 

<120> Novel Human Transporter Proteins and Polynucleotides Encoding the 



Same 

<130> LEX-0141-PCT 

<150> US 60/185,956 
<151> 2000-02-29 

<160> 11 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 3534 

<212> DNA 

<213> homo sapiens 

<400> 1 



<400> 1 

atgtggcgct ggatccggca gcagctgggt tttgacccac cacatcagag tgacacaaga 
accatctacg tagccaacag gtttcctcag aatggccttt acacacctca ^^aatttata 
gataacagga tcatttcatc taagtacact gtgtggaatt ttgttccaaa ^aatttattt 
aaacaattca gaagagtggc aaacttttat tttcttatta tatttttggt tcagcttatg 
aSStaca^ ctaccagJcc agttaccagt ggacttccat tattctttgt gataacagta 
actgccataa agcagggata tgaagattgg ttacggcata actcagataa tgaagtaaat 
ggagctcctg tttatgttgt tcgaagtggt ggccttgtaa aaactagatc aaaaaacatt 
cggqtgggtg atattgttcg aatagccaaa gatgaaattt ttcctgcaga cttggtgctt 
ct^cctcag atcgactgga tggttcctgt cacgttacaa ctgctagttt ggacggagaa 
actaacctga agacacatgt ggcagttcca gaaacagcat tattacaaac agttgccaat 
ttggacactc tagtagctgt aatagaatgc cagcaaccag aagcagactt atacagattc 
atgggacgaa tgatcataac ccaacaaatg gaagaaattg taagacctct ggggccggag 
agtctcctgc ttcgtggagc cagattaaaa aacacaaaag aaatttttgg tgttgcggta 
tacactggaa tggaaactaa gatggcatta aattacaaga gcaaatcaca gaaacgatct 
gcagtag^^^ agtcaatgaa tacatttttg ataatttatc tagtaattct tatatctgaa 
gctgtcatca gcactatctt gaagtataca tggcaagctg aagaaaaatg ggatgaacct 
tggtataacc aaaaaacaga acatcaaaga aatagcagta agattctgag atttatttca 
gacttccttg cttttttggt tctctacaat ttcatcattc caatttcatt ^tatgtgaca 
gtcgaaatgc agaaatttct tggatcattt tttattggct gggatcttga tctgtatcat 
gaagaatcag atcagaaagc tcaagtcaat acttccgatc tgaatgaaga gcttggacag 
gtagagtacg tgtttacaga taaaactggt acactgacag aaaatgagat gcagtttcgg 
gaatgttcaa ttaatggcat gaaataccaa gaaattaatg gtagacttgt acccgaagga 
ccaacaccag actcttcaga aggaaactta tcttatctta gtagtttatc ccatcttaac 
aacttatccc atcttacaac cagttcctct ttcagaacca gtcctgaaaa tgaaactgaa 
ctaattaaag aacatgatct cttctttaaa gcagtcagtc tctgtcacac tgtacagatt 
agcaatgttc aaactgactg cactggtgat ggtccctggc aatccaacct ggx:accatcg 
cagttggagt actatgcatc ttcaccagat gaaaaggctc tagtagaagc tgctgcaagg 
attggtattg tgtttattgg caattctgaa gaaactatgg aggttaaaac tcttggaaaa 
ctggaacggt acaaactgct tcatattctg gaatttgatt cagatcgtag gagaatgagt 
gtaattgttc aggcaccttc aggtgagaag ttattatttg ctaaaggagc tgagtcatca 
attctcccta aatgtatagg tggagaaata gaaaaaacca gaattcatgt agatgaattt 
gctttgaaag ggctaagaac tctgtgtata gcatatagaa aatttacatc aaaagagtat 
gaggaaatag ataaacgcat atttgaagcc aggactgcct tgcagcagcg ggaagagaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



1/16 



wo 01/64878 



PCT/OSOl/06462 



ttggcagctg ttttccagtt catagagaaa gacctgatat tacttggagc cacagcagta 
gaagacagac tacaagataa agttcgagaa actattgaag cattgagaat ggctggtatc 
aaagtatggg tacttactgg ggataaacat gaaacagctg ttagtgtgag tttatcatgt 
ggccattttc atagaaccat gaacatcctt gaacttataa accagaaatc agacagcgag 
tgtgctgaac aattgaggca gcttgccaga agaattacag aggatcatgt gattcagcat 
gggctggtag tggatgggac cagcctatct cttgcactca gggagcatga aaaactattt 
atggaagttt gcagaaattg ttcagctgta ttatgctgtc gtatggctcc actgcagaaa 
qcaaaagtaa taagactaat aaaaatatca cctgagaaac ctataacatt ggctgttggt 
gatggtgcta atgacgtaag catgatacaa gaagcccatg ttggcatagg aatcatgggt 
aaagaaggaa gacaggctgc aagaaacagt gactatgcaa tagccagatt taagttcctc 
tccaaattgc tttttgttca tggtcatttt tattatatta gaatagctac ccttgtacag 
tatttttttt ataagaatgt gtgctttatc acaccccagt ttttatatca ^ttctactgt 
ttgttttctc agcaaacatt gtatgacagc gtgtacctga ctttatacaa tatttgtttt 
acttccctac ctattctgat atatagtctt ttggaacagc atgtagaccc tcatgtgtta 
caaaataagc ccacccttta tcgagacatt agtaaaaacc gcctcttaag tattaaaaca 
tttctttatt ggaccatcct gggcttcagt catgccttta ttttcttttt tggatcctat 
ttactaatag ggaaagatac atctctgctt ggaaatggcc agatgttygg aaactggaca 
tttggcactt tggtcttcac agtcatggtt attacagtca cagtaaagat ggctctggaa 
actcattttt ggacttggat caaccatctc gttacctggg gatctattat attttatttt 
gtattttcct tgttttatgg agggattctc tggccatttt tgggctccca gaatatgtat 
tttgtgttta ttcagctcct gtcaagtggt tctgcttggt ttgccataat cctcatggtt 
gttacatgtc tatttcttga tatcataaag aaggtctttg accgacacct ccaccctaca 
agtactgaaa aggcacagct tactgaaaca aatgcaggta tcaagtgctt ggactccatg 
tgctgtttcc cggaaggaga agcagcgtgt gcatctgttg gaagaatgct ggaacgagtt 
ataggaagat gtagtccaac ccacatcagc agatcatgga gtgcatcgga tcctttctat 
accaacgaca ggagcatctt gactctctcc acaatggact catctacttg ttaa 

<210> 2 
<211> 1177 
<212> PRT 

<213> homo sapiens 

llt'H Arg Trp He Arg Gin Gin Leu Gly Phe Asp Pro Pro His Gin 

1 5 10 

Ser ASP Thr Arg Thr He Tyr Val Ala Asn Arg Phe Pro Gin Asn Gly 

20 25 ^° „ . 

Leu Tyr Thr Pro Gin Lys Phe He Asp Asn Arg He He Ser Ser Lys 

35 40 45 

Tyr Thr Val Trp Asn Phe Val Pro Lys Asn Leu Phe Glu Gin Phe Arg 

50 55 60 

Arg val Ala Asn Phe Tyr Phe Leu He He Phe Leu Val Gin Leu Met 
65 "^0 75 

He Asp Thr Pro Thr Ser Pro Val Thr Ser Gly Leu Pro Leu Phe Phe 

85 .90 95 

Val He Thr Val Thr Ala He Lys Gin Gly Tyr Glu Asp Trp Leu Arg 

100 105 110 

His Asn ser Asp Asn Glu Val Asn Gly Ala Pro Val Tyr Val Val Arg 

115 120 125 

Ser Gly Gly Leu Val Lys Thr Arg Ser Lys Asn He Arg Val Gly Asp 

130 135 140 

He Val Arg He Ala Lys Asp Glu He Phe Pro Ala Asp Leu Val Leu 
145 150 155 

Leu Ser Ser Asp Arg Leu Asp Gly Ser Cys His Val Thr Thr Ala Ser 

165 170 1" 

Leu Asp Gly Glu Thr Asn Leu Lys Thr His Val Ala Val Pro Glu Thr 
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180 185 
Ala Leu Leu Gin Thr Val Ala Asn Leu Asp Thr Leu Val Ala Val He 

195 200 205 

Glu cys Gin Gin Pro Glu Ala Asp Leu Tyr Arg Phe Met Gly Arg Met 

91 n 215 220 

lie lie Thr Gin Gin Met Glu Glu He Val Arg Pro Leu Gly Pro Glu 

230 23b 
Ser Leu Leu Leu Arg Gly Ala Arg Leu Lys Asn Thr Lys Glu lie Phe 

245 250 
Gly val Ala Val Tyr Thr Gly Met Glu Thr Lys Met Ala Leu Asn Tyr 

260 265 270 

Lys ser Lys Ser Gin Lys Arg Ser Ala Val Glu Lys Ser Met Asn Thr 

Phe Leu lie He Tyr Leu Val ite Leu He Ser Glu Ala Val He Ser 

290 295 300 

Thr lie Leu Lys Tyr Thr Trp Gin Ala Glu Glu Lys Trp Asp Glu Pro 
305 310 315 ^, i'" 

Trp Tyr Asn Gin Lys Thr Glu His Gin Arg Asn Ser Ser Lys lie Leu 

325 330 
Arg Phe He Ser Asp Phe Leu Ala Phe Leu Val Leu Tyr Asn Phe He 
340 

He Pro He Ser Leu Tyr Val Thr Val Glu Met Gin Lys Phe Leu Gly 

355 360 365 

ser Phe Phe He Gly Trp Asp Leu Asp Leu Tyr His Glu Glu Ser Asp 

oTn 375 
Gin Lys Ala Gin val Asn Thr Ser Asp Leu Asn Glu Glu Leu Gly Gin 

390 395 
val Glu Tyr Val Phe Thr Asp Lys Thr Gly Thr Leu Thr Glu Asn Glu 

Met Gin Phe Arg gJu Cys Ser He Asn Gly Met Lys Tyr Gin Glu He 

420 ^25 
Asn Gly Arg Leu Val Pro Glu Gly Pro Thr Pro Asp Ser Ser Glu Gly 

435 

Asn Leu ser Tyr Leu Ser Ser Leu Ser His Leu Asn Asn Leu Ser His 

4 55 4 du 

Leu m Thr ser Ser Ser Phe Arg Thr Ser Pro Glu Asn Glu Thr Glu 
ACS 470 475 

leu He Lys Glu His Asp Leu Phe Phe Lys Ala Val Ser Leu Cys H.s 

485 490 
Thr val Gin He Ser Asn Val Gin Thr Asp Cys Thr Gly Asp Gly Pro 

500 505 
Trp Gin Ser Asn Leu Ala Pro Ser Gin Leu Glu Tyr Tyr Ala Ser Ser 

515 520 525 

Pro ASP Glu Lys Ala Leu Val Glu Ala Ala Ala Arg He Gly He Val 

530 535 540 

Phe He Gly Asn Ser Glu Glu Thr Met Glu Val Lys Thr Leu Gly Lys 
545 550 555 

Leu Glu Arg Tyr Lys Leu Leu His He Leu Glu Phe Asp Ser Asp Arg 

565 570 =>'^ 

Arg Arg Met Ser Val He Val Gin Ala Pro Ser Gly Glu Lys Leu Leu 

580 585 590 

Phe Ala Lys Gly Ala Glu Ser Ser He Leu Pro Lys Cys He Gly Gly 

595 600 605 

Glu He Glu Lys Thr Arg He His Val Asp Glu Phe Ala Leu Lys Gly 

610 615 "° , ^ 

Leu Arg Thr Leu Cys He Ala Tyr Arg Lys Phe Thr Ser Lys Glu Tyr 
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Su Glu ne ASP Lys Zl lie Phe Glu Ala Arg Thr Ala Leu Gin Gin 



645 



630 "5 640 

■ri A DViA Cln ..^^ _ 

650 655 



Arg Glu Glu Lys Leu Ala Ala Val Phe Gin Phe He Glu Lys Asp Leu 

665 



He Leu Leu g!5 Ala Thr Ala Val Glu Asp Arg Leu Gin Asp Lys Val 



Arg Glu Sr He Glu Ala Leu Met Ala Gly lie Lys Val Trp Val 



680 



685 



Leu 5hr Gly Asp Lys His III Thr Ala Val Ser Val Ser Leu Ser Cys 



695 700 



G?y His Phe His Arg T^l Met Asn He Leu Glu Leu He Asn Gin Lys 



710 



715 



725 



730 



ser ASP ser Glu Cys Ala Glu Gin Leu Arg Gin Leu Ala Arg Arg He 

740 

Thr Glu ASP His Val He Gin His Gly Leu Val Val Asp Gly Thr Ser 

760 



Leu ser leu Ala Leu Arg Glu hI^ Glu Lys Leu Phe Met Glu Val Cys 



Arg Iln Cys Ser Ala Val Leil Cys Cys Arg Met Ala Pro Leu Gin Lys 



775 



780 



Tia Lys val He Arg Leu He Lys He Ser Pro Glu Lys Pro He Thr 



790 



795 800 



805 



810 SIS 



Leu Ala val Gly Asp Gly Ala Asn Asp Val Ser Met He Gin Glu Ala 

820 825 
His val Gly He Gly He Met Gly Lys Glu Gly Arg Gin Ala Ala Arg 

835 840 
Asn ser Asp Tyr Ala He Ala Arg Phe Lys Phe Leu Ser Lys Leu Leu 



855 



Phe val His Gly His Phe Tyr Tyr He Arg lie Ala Thr Leu Val Gin 
cc, 870 oId owv 

III Phe Phe Tyr Lys Asn Val Cys Phe He Thr Pro Gin Phe Leu Tyr 

885 

Gin Phe Tyr Cys Leu Phe Ser Gin Gin Thr Leu Tyr Asp Ser Val Tyr 
900 

Leu Thr Leu Tyr Asn He Cys Phe Thr Ser.Leu Pro He Leu He Tyr 

915 920 '25 

ser Leu Leu Glu Gin His Val Asp Pro His Val Leu Gin Asn Lys Pro 

930 935 
Thr Leu Tyr Arg Asp He Ser Lys Asn Arg Leu Leu Ser He Lys Thr 

c 950 

pie Leu Tyr Trp Thr He Leu Gly Phe Ser His Ala Phe He Phe Phe 

965 5'^^ 
Phe Gly ser Tyr Leu Leu He Gly Lys Asp Thr Ser Leu Leu Gly Asn 

980 985 
Gly Gin Met Phe Gly Asn Trp Thr Phe Gly Thr Leu Val Phe Thr Val 

995 1000 1005 

Met val He Thr Val Thr Val Lys Met Ala Leu Glu Thr His Phe Trp 

Thr Cue Asn His Leu Ja^Thr Trp Gly Ser i^lfne Phe Tyr Phe 
in9S 1030 1035 j;"^" 

ial Phe ser Leu Phe Tyr Gly Gly He Leu^Trp Pro Phe Leu Gly^Ser 

Gin Asn Met Tyr Phe Val Phe He Gin Leu Leu Ser Ser Gly Ser Ala 

1060 1065 1"'" 

Trp Phe Ala He He Leu Met Val Val Thr Cys Leu Phe Leu Asp He 
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1075 1080 10®^ 

lie Lys Lys Val Phe Asp Arg His Leu His Pro Thr Ser Thr Glu Lys 

1090 1095 11"° 

Ala Gin Leu Thr Glu Thr Asn Ala Gly He Lys Cys Leu Asp Ser Met 
nn«^ 1110 1115 

cis Cys Phe pro Glu Gly Glu Ala Ala Cys Ma Ser Val Gly Arg^Met 

Leu Glu Arg Val n^Gly Arg Cys Ser Pro Thr His He Ser Arg Ser 

1140 1145 11^^ 

Trp ser Ala Ser Asp Pro Phe Tyr Thr Asn Asp Arg Ser He Leu Thr 

1155 1160 
Leu Ser Thr Met Asp Ser Ser Thr Cys 
1170 1175 

<210> 3 
<211> 1125 
<212> DNA 

<213> homo sapiens 

a'tgjggcgct ggatccggca gcagctgggt tttgacccac cacatcagag tgacacaaga 60 

acSrctacg Jagccaacag gtttcctcag aatggccttt acacacctca ^aaatttata 120 

aataacagga tcatttcatc taagtacact gtgtggaatt ttgttccaaa a^^tttattt i« 

gfacagtSa gaagagtggc aaacttttat tttcttatta tatttttggt tcagcttatg 240 

attaatacac ctaccagtcc agttaccagt ggacttccat tattctttgt gataacagta Juu 

actgccataa agLgggata tgaagattgg ttacggcata actcagataa tgaagtaaat 360 

ggagctStg tJtaJJttgt tcgaagtggt ggccttgtaa aaactagatc aaaaaacatt 420 

caqqtgggtg atattgttcg aatagccaaa gatgaaattt ttcctgcaga cttggtgctt 480 

SgSSSg atcgactgga tggttcctgt cacgttacaa ctgctagttt ggacggagaa 540 

Saacctga agaLcatgt ggcagttcca gaaacagcat tattacaaac agttgccaat 600 

ttaaacactc tagtagctgt aatagaatgc cagcaaccag aagcagactt atacagattc oo« 

^ ragSra! ^ ^ 
rgfai r-atra: ^^^g ^ ^ ^ 

rH- rrrara =2 =tra ^ 1 

ccftgctttt ttggttctct acaatttcat cattccaatt tcattatatg tgacagtcga 080 
aatgcagaaa tttcttggat cattttttat tggctgggat cttga 

<210> 4 
<211> 374 
<212> PRT 

<213> homo sapiens 

Me^JrJ Arg Trp He Arg Gin Gin Leu Gly Phe Asp Pro Pro His Gin 

ser Asp Thr Arg Thr lie Tyr Val Ala Asn Arg Phe Pro Gin Asn Gly 

•20 25 30 

Leu Tyr Thr Pro Gin Lys Phe He Asp Asn Arg He He Ser Ser Lys 

35 40 45 

Tyr Thr Val Trp Asn Phe Val Pro Lys Asn Leu Phe Glu Gin Phe Arg 

50 55 60 

Arg val Ala Asn Phe Tyr Phe Leu He He Phe Leu Val Gin Leu Met 
65 70 
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lie ASP Thr Pro Thr Ser Pro Val Thr Ser Gly Leu Pro Leu Phe Phe 

85 90 
val lie Thr Val Thr Ala He Lys Gin Gly Tyr Glu Asp Trp Leu Arg 

100 105 
His Asn ser Asp Asn Glu Val Asn Gly Ala Pro Val Tyr Val Val Arg 

115 120 125 

ser Gly Gly Leu Val Lys Thr Arg Ser Lys Asn He Arg Val Gly Asp 

130 135 1*0 , , 

lie val Arg He Ala Lys Asp Glu He Phe Pro Ala Asp Leu Val Leu 
145 150 155 

Leu ser Ser Asp Arg Leu Asp Gly Ser Cys His Val Thr Thr Ala Ser 

165 170 175 

Leu ASP Gly Glu Thr Asn Leu Lys Thr His Val Ala Val Pro Glu Thr 

180 185 1^" 

Ala Leu Leu Gin Thr Val Ala Asn Leu Asp Thr Leu Val Ala Val He 

195 200 205 

Glu Cys Gin Gin Pro Glu Ala Asp Leu Tyr Arg Phe Met Gly Arg Met 

210 215 220 

lie lie Thr Gin Gin Met Glu Glu He Val Arg Pro Leu Gly Pro Glu 
225 230 235 

ser Leu Leu Leu Arg Gly Ala Arg Leu Lys Asn Thr Lys Glu lie Phe 

245 250 255 

Gly val Ala Val Tyr Thr Gly Met Glu Thr Lys Met Ala. Leu Asn Tyr 

260 265 270 

Lys ser Lys Ser Gin Lys Arg Ser Ala Val Glu Lys Ser Met Asn Thr 

275 280 285 

Phe Leu lie He Tyr Leu Val He Leu He Ser Glu Ala Val He Ser 

290 295 300 

Thr He Leu Lys Tyr Thr Trp Gin Ala Glu Glu Lys Trp Asp Glu Pro 
305 

Trp Tyr Asn Gin Lys Thr Glu His Gin Arg Asn Ser Asn Ser Glu He 

325 330 
Tyr Phe Arg Leu Pro Cys Phe Phe Gly Ser Leu Gin Phe His His Ser 

340 345 350 

Asn Phe He He Cys Asp Ser Arg Asn Ala Glu He Ser Trp He He 

355 360 365 

Phe Tyr Trp Leu Gly Ser 
370 

<210> 5 

<211> 7277 

<212> DNA 

<213> hOTK> sapiens 

gccgcgggat gggaacgcgg cgcggggagt gaggcagtgg cggcggcggc ^gtaagcgga 60 
acttcggccc gaggggctcg cccgctcccg cctctgtctt gtcggcctcc acctgcagcc 120 
ccgcggcccc cgcgccccgc gggacccgga cggcgacgac gggggaatgt J^cgctggat 180 
ccggcagcag ctgggttttg acccaccaca tcagagtgac acaagaacca tctacgtagc 240 
caacaggtt? ccJcagaatg gcctttacac acctcagaaa tttatagata acaggatcat 300 
"catctaag tacactgtgt ggaattttgt tccaaaaaat ttatttgaac agttcagaag 360 
agtggcaaac ttttattttc ttattatatt tttggttcag cttatgattg atacacctac 
cag?ccagtt accagtggac ttccattatt ctttgtgata acagtaactg ccataaagca 
gggatatgaa gattggttac ggcataactc agataatgaa gtaaatggag ctcctgttta 
tgttgttcga agtggtggcc ttgtaaaaac tagatcaaaa aacattcggg tgggtgatat 600 
Sttcgaa?a gccaaagatg aaatttttcc tgcagacttg gtgcttctgt cctcagatcg 660 
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actggatggt 
acatgtggca 
agctgtaata 
cataacccaa 



tggagccaga 
aactaagatg 
aatgaataca 
tatcttgaag 
aacagaacat 
tttggttctc 
atttcttgga 
gaaagctcaa 
tacagataaa 
tggcatgaaa 
ttcagaagga 
tacaaccagt 
tgatctcttc 
tgactgcact 
tgcatcttca 
tattggcaat 
actgcttcat 
accttcaggt 
tataggtgga 
aagaactctg 
acgcatattt 
ccagttcata 
agataaagtt 
tactggggat 
aaccatgaac 
gaggcagctt 
tgggaccagc 
aaattgttca 
actaataaaa 
cgtaagcatg 
ggctgcaaga 
tgttcatggt 
gaatgtgtgc 
aacattgtat 
tctgatatat 
cctttatcga 
catcctgggc 
agatacatct 
cttcacagtc 
ttggatcaac 
ttatggaggg 
gctcctgtca 
tcttgatatc 
acagcttact 
aggagaagca 
tccaacccac 
catcttgact 
ggagccagtt 
gctctgaaat 
caaacaaaca 
tgttaaaatt 
ttatgggact 



tcctgtcacg 
gttccagaaa 
gaatgccagc 
caaatggaag 
ttaaaaaaca 
gcattaaatt 
tttttgataa 
tatacatggc 
caaagaaata 
tacaatttca 
tcatttttta 
gtcaatactt 
actggtacac 
taccaagaaa 
aacttatctt 
tcctctttca 
tttaaagcag 
ggtgatggtc 
ccagatgaaa 
tctgaagaaa 
attctggaat 
gagaagttat 
gaaatagaaa 
tgtatagcat 
gaagccagga 
gagaaagacc 
cgagaaacta 
aaacatgaaa 
atccttgaac 
gccagaagaa 
ctatctcttg 
gctgtattat 
atatcacctg 
atacaagaag 
aacagtgact 
catttttatt 
tttatcacac 
gacagcgtgt 
agtcttttgg 
gacattagta 
ttcagtcatg 
ctgcttggaa 
atggttatta 
catctcgtta 
attctctggc 
agtggttctg 
ataaagaagg 
gaaacaaatg 
gcgtgtgcat 
atcagcagat 
ctctccacaa 
cacctccttt 
taatttccaa 
gaaagcatta 
tgagaataaa 
cctaatggca 



ttacaactgc 
cagcattatt 
aaccagaagc 
aaattgtaag 
caaaagaaat 
acaagagcaa 
tttatctagt 
aagctgaaga 
gcagtaagat 
tcattccaat 
ttggctggga 
ccgatctgaa 
tgacagaaaa 
ttaatggtag 
atcttagtag 
gaaccagtcc 
tcagtctctg 
cctggcaatc 
aggctctagt 
ctatggaggt 
ttgattcaga 
tatttgctaa 
aaaccagaat 
atagaaaatt 
ctgccttgca 
tgatattact 
ttgaagcatt 
cagctgttag 
ttataaacca 



ttacagagga 
cactcaggga 
gctgtcgtat 
agaaacctat 
cccatgttgg 
atgcaatagc 
atattagaat 
cccagttttt 
acctgacttt 
aacagcatgt 
aaaaccgcct 
cctttatttt 
atggccagat 
cagtcacagt 
cctggggatc 
catttttggg 
cttggtttgc 
tctttgaccg 
caggtatcaa 
ctgttggaag 
catggagtgc 
tggactcatc 
cctaaaattc 
aatctttgta 
gtacaagccc 
gagacatttt 
tttcagtctg 



tagtttggac 
acaaacagtt 
agacttatac 
acctctgggg 
ttttggtgtt 
atcacagaaa 
aattcttata 
aaaatgggat 
tctgagattt 
ttcattatat 
tcttgatctg 
tgaagagctt 
tgagatgcag 
acttgtaccc 
tttatcccat 
tgaaaatgaa 
tcacactgta 
caacctggca 
agaagctgct 
taaaactctt 
tcgtaggaga 
aggagctgag 
tcatgtagat 
tacatcaaaa 
gcagcgggaa 
tggagccaca 
gagaatggct 
tgtgagttta 
gaaatcagac 
tcatgtgatt 
gcatgaaaaa 
ggctccactg 
aacattggct 
cataggaatc 
cagatttaag 
agctaccctt 
atatcagttc 
atacaatatt 
agaccctcat 
cttaagtatt 
cttttttgga 
gttyggaaac 
aaagatggct 
tattatattt 
ctcccagaat 
cataatcctc 
acacctccac 
gtgcttggac 
aatgctggaa 
atcggatcct 
tacttgttaa 
agtgtgatca 
gtagttcata 
ctcccaacac 
tcatctcttt 
ttgctgaggc 



ggagaaacta 
gccaatttgg 
agattcatgg 
ccggagagtc 
gcggtataca 
cgatctgcag 
tctgaagctg 
gaaccttggt 
atttcagact 
gtgacagtcg 
tatcatgaag 
ggacaggtag 
tttcgggaat 
gaaggaccaa 
cttaacaact 
actgaactaa 
cagattagca 
ccatcgcagt 
gcaaggattg 
ggaaaactgg 
atgagtgtaa 
tcatcaattc 
gaatttgctt 
gagtatgagg 
gagaaattgg 
gcagtagaag 
ggtatcaaag 
tcatgtggcc 
agcgagtgtg 
cagcatgggc 
ctatttatgg 



acctgaagac 
acactctagt 
gacgaatgat 
tcctgcttcg 
ctggaatgga 
tagaaaagtc 
tcatcagcac 
ataaccaaaa 
tccttgcttt 
aaatgcagaa 
aatcagatca 
agtacgtgtt 
gttcaattaa 
caccagactc 
tatcccatct 



cagaaagcaa 
gttggtgatg 
atgggtaaag 
ttcctctcca 
gtacagtatt 
tactgtttgt 
tgttttactt 
gtgttacaaa 
aaaacatttc 
tcctatttac 
tggacatttg 
ctggaaactc 
tattttgtat 
atgtattttg 
atggttgtta 
cctacaagta 
tccatgtgct 
cgagttatag 
ttctatacca 
aggggcagta 
ccctgttaat 
cccactcaga 
ccttaatttg 
gtctggtttg 
cattatattt 



ttaaagaaca 
atgttcaaac 
tggagtacta 
gtattgtgtt 
aacggtacaa 
ttgttcaggc 
tccctaaatg 
tgaaagggct 
aaatagataa 
cagctgtttt 
acagactaca 
tatgggtact 
attttcatag 
ctgaacaatt 
tggtagtgga 
aagtttgcag 
aagtaataag 
gtgctaatga 
aaggaagaca 
aattgctttt 
ttttttataa 
tttctcagca 
ccctacctat 



ataagcccac 

tttattggac 

taatagggaa 

gcactttggt 

atttttggac 

tttccttgtt 

tgtttattca 

catgtctatt 

ctgaaaaggc 

gtttcccgga 

gaagatgtag 

acgacaggag 

gtactttgtg 

ggccacacta 

gttataatgg 

aatctgaaca 

tcccttgtgc 

taatataaat 



720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
27 60 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
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gtagaaaaaa 
tcttctattt 
ttttattaga 
tttggtccct 
attaggagaa 
tcctctaact 
atatatacaa 
ttttgtgaca 
aatttaatgt 
ttgctttttt 
gtttaactat 
tacaaatcag 
ccgacttaga 
aataatctta 
ttctgccagt 
caacttgtat 
tgttactgta 
agttcagtat 
agtgattatt 
taagataatg 
gagatttatt 
tggactggca 
taaactatta 
gttattaagc 
ctggatttag 
tctgcattca 
ctaggtggga 
aatgacaccc 
actgagtact 



gcaagacagg 
g tact caeca 
accactagtg 
tatcttgtga 
caagctttca 
gtaatcacaa 
tgatgttgat 
attgctaatc 
cctgaagata 
gactcagcac 
gggccaggga 
atggtctgaa 
ttccagacct 
aaccctagga 
ctggaaaaga 
caaggccact 
aaatctcgtg 
gatcagtttg 
taaaacatgt 
gagttatcta 
tgaatgttgt 
ttagattttt 
tggacctgag 
aaatgtagta 
atcaaataaa 
aaaaaaaaaa 



gagagaaatc 
aaatctgctt 
gttatattta 
cagtgacctg 
tcatttccaa 
gtactgtaac 
atcaggaatc 
gagtataaag 
ctttatcatt 
tctctttacc 
atcaataaaa 
aatagtatgg 
catgctcttc 
ttttcaggtt 
gtcccagtac 
caggtaatgt 
attaactcat 
tgtcatttgt 
agcttgagaa 
gttaaatgcg 
gtgcagccct 
ggggaaagaa 
ttttcttgct 
taaatattaa 
gaagatctgt 
ctaattcatg 
ggatatggaa 
agtaggcctg 
gacaccagac 
aggtcagctc 
tttgaagtta 
tgctcatcct 
ttttaagaaa 
gtagttttct 
ggaaagtgtc 
aattaactta 
agtggtctca 
gtggcaagca 
cctgcctcag 
gaactacagc 
ttttctaact 
gagccacacc 
aatacttgca 
ttgtgagatt 
agaaaattga 
aaggctgtag 
gggtatgata 
tttattaatt 
ttttcataaa 
tcctataaaa 
atgaggaatg 
acactgtggc 
tattattgta 
aactttccta 
aaaaaaa 



ttagtaaaga 

ctgtaaatta 
aagcttttca 
tgttgttaat 
ccattattta 
acagcctgta 
aggtccgttc 
accctatagt 
ggatcttttg 
tgaaggctct 
agtttggaca 
gtaattaaat 
cctttctata 
atgtcatcta 
aaggcatatt 
tagcaataaa 
tgcacttcaa 
ttttgtttta 
ctattaccca 
gttttaccaa 
aagcttcctt 
tggtagagac 
aatgtaacat 
ttttcaaaaa 
tattctggaa 
ttccagaaga 
atttgctcat 
cattacattt 
aaagactcca 
gcctataatg 
gttaaggaga 
gaactgttac 
agagttttct 
agtgctatat 
ccctttgcat 
cccttatctg 
aatcgatttg 
ccaagtcagt 
cttcagcagg 
tacgaagacc 
gtcctctttc 
caggccctat 
ttctgcccta 
gaacttatct 
cagttaagag 
gaaaagggag 
taagcaggta 
ttggtcccca 
agtaaaacac 
aatagatcat 
agtatctgga 
tgtctaatgt 
cagctactca 
tataattaaa 



gtatttttta 
tgctgaaagt 
tgggaaaagt 
tcattaatgc 
ctgcagtatg 
aagttagcca 
accgaacttc 
gggtaaatta 
catgctttaa 
gtgtatagta 
gtatttaaat 
gaatacaaaa 
agctagattt 
acttatagca 
tcaggtgtgg 
ttaaatgcta 
aacctaactt 



ttgaaaagta 
gctctaagca 
gttttccctt 
cccatttcat 
agaaattaag 
ttgtctgttc 
tagtccttct 
gtactaaaaa 
ggaaataatg 
aaaatctctt 
acatgaccgt 
aagtcataaa 
gtgcttaaag 
actttatttt 
tccaaatcca 
atttatttaa 
tcatcctgta 
atttctttaa 
ccaaaaccag 
cctccctttg 
ttccaaaatt 
cstaggctca 
tgctgtcgag 
ttgggtctaa 
cctgaacagg 
cggttagtac 
gatcgcttga 
ccaaaagttt 
aatcttccat 
ttaataaaaa 
cgtacagaca 
tattaaagtg 
aactcatgat 
aatattgtag 
aatcctttaa 
taatttttta 
aaaaaaaaaa 



gtattagctt 
ttgccttgag 
taatgtgaat 
attctgagtt 
gggagtaaat 
tataaatgca 
aaattgatgt 
gatactatta 
tctggttaac 
tttcatgaca 
attgcaaata 
agaagagcct 
tagaataaag 
aactaccaca 
ctgtggaatg 
agaatgatta 
ccatcctgaa 
atgttgtctt 
aataatgatt 
gaaaatgtaa 
gaatataagg 
actttatcct 
cagtgatgta 
ttaacttaga 
gaataataca 
aagatatact 
ataaaacgtg 
gtttatttgc 
atagcctatg 
tgtgattgat 
tttaaaaaaa 
ctccgttttt 
gaaagtaaca 
aaactcttac 
aattctttct 
agcaaaatgc 
cctcgtctga 
gcccctcagc 
ccctgggcgg 
ttgagaaaag 
agctcataat 
agactaaaca 
caggactgag 
gactcctaat 
ttaaaatatg 
gttggtgttt 
taacacacca 
ttttatttct 
ctgttttatg 
atgtttgtaa 
caatacttgg 
aaattctctg 
aagtttatga 
aaaaaaaaaa 



gattattgac 
aactctattt 
actgaggaat 
cacagagcaa 
ttataccaat 
agggtatatc 
ttactaatat 
gcatattatt 
atatttaaat 
tcgttgtaca 
tgtttaatta 
ctttctgcag 
ggtttcagtt 
atacagtgag 
taaaaatgct 
atcgggtaca 
tttatcaagt 
aagatttaga 
gtatacatat 
ttcctttatg 
cttctagaat 
tgtttgcttg 
aggatattaa 
tatttcatag 
acgtacaatg 
cagtagagta 
catataacaa 
catcaaataa 
accaactgca 
gtaattttct 
gtaaatggca 
aaagcaaaat 
atgcagtctg 
tacgtaacca 
ttggaaagta 
taaatacgtt 
gggctgtaag 
tgctttaagt 
agcaaagtat 
gggagaattt 
acacaaaggc 
gaggcaaatc 
gtcatttcta 
aggcaggagt 
ctactctgaa 
ttcctgtaaa 
aagagttacg 
attttgaaat 
tgaaataact 
tcatggtaat 
tttaaaattt 
cattgtcagt 
agttatattt 
aaaaaacaaa 



4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

54 60 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7277 
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<210> 6 

<2ll> 2913 

<212> DNA 

<213> homo sapiens 



aJSqcacaac tagagaggag cgccatctct ggcttcagct ctaagtccag gcgaaactca 60 

Sg'tatg atgttaajcg tgaagtatac aatgaggaga cctttcaaca 9^acacaaa 120 

aoaaaaacct cctcttctgg gaacatgaac atcaacatca ccaccttcag acaccacgtc 180 

ca^gccgS gctcatggca caggttccta cgatgcrtgc ttacaatctt tcccttccta 240 

gaSgga?gt gtatgtatcg attaaaggat tggcttctgg gagacttact tgctggtata 300 

IrtSJqgcc ttgtgcaagt tccccaaggc ctgacactta gtttgctggc aaggcaactg 360 

SScctc Saacatcgc ttatgcagct ttctgttctt cggtaatcta tgtaattttt 420 

ggltcgtgtc atcaaatgtc cgttggttcc ttcttcctgg tgagtgctct gctgatcaac 480 

ggatcgtgui- ^ ^,,^^„„4.«, toaaatcttt cgtcaagaat 540 



"^^IIZ - a- ra;raa;;5 «actggt^ taggatcttt cgtcaagaat 540 

ga^Stcgg JcLtccta ccttatgggc tataataaat ccttgagtgt ^tggcaacc 600 

aSacttttc tgactgggat tattcagcta ataatgggcg tattgggttt gggcttcatt 660 

g^ctfac: tJccggagtc tgcaatgaat gcttacctgg ctgctgtggc acttcatatc 720 

atgctgtccc agctgacttt catctttggg attatgatta gtttccatgc cggtcccatc 780 

tcScttct a?gaLtaat taattactgt gtagctctcc caaaagcgaa "ccaccagc 84 

attctagtat ttctaactgt tgttgttgct ctgcgaatca acaaatgtat ^agaatttct 900 

Jtcaatcagt atcccattga gtttcccatg gaattatttc tgattattgg cttcactgtg 960 

attgcaaaca agataagcat ggccacagaa accagccaga cgcttattga ^atgattcct 1020 

.atUttc tgcttcctgt aacaccagat ttcagccttc ttcccaagat a ttt-- 1080 



gcctJctcct tatctttggt gagotccttt ctgctcatat ttctgggcaa gaagattgcc 1140 

agtcttcaca attacagtgt caattccaac caggatttaa tagccatcgg ^ctttgcaat 1200 

gSgtcagtt catttttcag atcttgtgtg tttactggtg ctattgctag gactattatc 1260 

caggataaat ctggaggaag acaacagttt gcatctctgg taggcgcagg tgtgatgctg 1320 

ctcctgatgg tgaagatggg acactttttc tacacactgc caaatgctgt gctggctggt 1380 

att2?ctS gcaacgtSt tccctacctt gaaaccattt ctaacctacc cagcctgtgg 1440 

^ggcaggacc aatatjctg tgctctttgg atgatgacat tctcatcttc aattttcctg 1500 

ggactggaca ttggactaat tatctcagta gtttctgctt tcttcatcac cactgttcgt 1560 

S«Sgag ctaagattct tctcctgggt caaatcccta acaccaacat "atagaagc 1620 

atcaatgatt atcgggagat catcaccatt cctggggtga aaatcttcca gtgctgcagc 1680 

tcaaSafat ttgtaaa^gt ttactaccta aagcataagc tgttaaaaga ggttgatatg 1740 

gtaaaggtgc ctcttaaaga agaagaaatt ttcagcttgt ttaattcaag tgacaccaat 1800 

ctacaaggag gaaagatttg caggtgtttc tgcaactgtg atgatctgga gccgctgccc I860 

aggattSt? acacagagcg atttgaaaat aaactggatc ccgaagcatc '^tccattaac. 1920 

Sgattcact gctcacattt tgagagcatg, aacacaagcc aaactgcatc cgaagaccaa 1980 

g^gccataca cagtatcgtc cgtgtctcag aaaaatcaag ggcaacagta tgaggaggtg 2040 

gaggaagttt ggcttcctaa taactcatca agaaacagct caccaggact gcctgatgtg 2100 

gcggaaagcc aggggaggag atcactcatc ccttactcag atgcgtctct actgcccagt 2160 

^cLcacca tcatcctgga tttctccatg gtacactacg tggattcacg ggg^^tagtc 2220 

Sattaagac agatatgcaa tgcctttcaa aacgccaaca ttttgatact cattgcaggg 2280 

JgtcactJtt ccatagicag ggcatttgag aggaatgatt tctttgacgc tggcatcacc 2340 

aagacccagc tgttcctcag cgttcacgac gccgtgctgt ttgccttgtc aaggaaggtc 2400 

aSggctcct c^gagttaag catcgatgaa tccgagacag tgatacggga aacctactca 2460 

gaaacagaca agaatgacaa ttcaagatat aaaatgagca gcagttttct aggaagccaa 2520 

aaaaatgtaa gtccaggctt catcaagatc caacagcctg tagaagagga Stcggagttg 2580 

gatttggagc Jggaatcaga acaagaggct gggctgggtc tggacctaga -tg^atcgg 2640 

gagctggagc ctgaaatgga gcccaaggct gagaccgaga ccaagaccca gaccgagatg 2700 

gagccccagc ctgagactga gcctgagatg gagcccaacc ccaaatctag jccaagagct 2760 

cacacttttc ctcagcagcg ttactggcct atgtatcatc cgtctatggc ttccacccag 2820 

tctcagactc agact.cggac atggtcagtg gagaggagac gccatcctat ggattcatac 2850 
tcaccagagg gcaacagcaa tgaagatgtc tag 



2913 
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<210> 7 
<211> 970 
<212> PRT 

<213> homo sapiens 
<220> 

<221> VARIANT 

<222> (1) . (970) 

<223> Xaa = Any Amino Acid 



Mer^ll Gin Leu Glu Arg Ser Ala lie Ser Gly Phe Ser Ser Lys Ser 

Arg Arg Asn Ser Phe Ala Tyr Asp Val Lys Arg Glu Val Tyr Asn Glu 

20 25 30 

Glu Thr Phe Gin Gin Glu His Lys Arg Lys Ala Ser Ser Ser Gly Asn 

35 40 45 

Met Asn lie Asn He Thr Thr Phe Arg His His Val Gin Cys Arg Cys 

50 55 60 

ser Trp His Arg Phe Leu Arg Cys Met Leu Thr He Phe Pro Phe Leu 
65 70 "^5 o" 

Glu Trp Met Cys Met Tyr Arg Leu Lys Asp Trp Leu Leu Gly Asp Leu 

85 90 
Leu Ala Gly He Ser Val Gly Leu Val Gin Val Pro Gin Gly Leu Thr 

100 

Leu ser Leu Leu Ala Arg Gin Leu He Pro Pro Leu Asn He Ala Tyr 

115 120 
Ala Ala Phe Cys Ser Ser Val He Tyr Val He Phe Gly Ser Cys His 

130 135 "0 

Gin Met Ser Val Gly Ser Phe Phe Leu Val Ser Ala Leu Leu He Asn 

ill Leu Lys val Ser Pro Phe Asn Asn Gly Gin Leu Val Met Gly Ser 

165 I'^O 1'5 

Phe val Lys Asn Glu Phe Ser Ala Pro Ser Tyr Leu Met Gly Tyr Asn 

180 185 l^'' 

Lys Ser Leu Ser Val Val Ala Thr Thr Thr Phe Leu Thr Gly He He 

195 200 205 

Gin Leu He Met Gly Val Leu Gly Leu Gly Phe He Ala Thr Tyr Leu 

210 215 220 

Pro Glu ser Ala Met Asn Ala Tyr Leu Ala Ala Val Ala Leu. His He 
?25 230 235 

Met Leu ser Gin Leu Thr Phe He Phe Gly He Met He Ser Phe His 

245 250 255 

Ala Gly pro He Ser Phe Phe Tyr Asp He He Asn Tyr Cys Val Ala 

260 265 270 

Leu Pro Lys Ala Asn Ser Thr Ser He Leu Val Phe Leu Thr Val Val 

275 280 285 

val Ala Leu Arg He Asn Lys Cys He Arg He Ser Phe Asn Gin Tyr 

290 295 300 

pro He Glu Phe Pro Met Glu Leu Phe Leu He He Gly Phe Thr Val 
•^AR 310 

He Ala Asn Lys He Ser Met Ala Thr Glu Thr Ser Gin Thr Leu He 

325 330 335 

ASP Met He Pro Tyr Ser Phe Leu Leu Pro Val Thr Pro Asp Phe Ser 

340 345 350 

Leu Leu Pro Lys He He Leu Gin Ala Phe Ser Leu Ser Leu Val Ser 
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355 360 365 

Ser Phe Leu Leu lie Phe Leu Gly Lys Lys lie Ala Ser Leu His Asn 

370 375 380 

Tyr ser Val Asn Ser Asn Gin Asp Leu He Ala He Gly Leu Cys Asn 
385 390 395 400 

val val ser Ser Phe Phe Arg Ser Cys Val Phe Thr Gly Ala lie Ala 

405 410 415 

Arg Thr He He Gin Asp Lys Ser Gly Gly Arg Gin Gin Phe Ala Ser 

420 425 430 

Leu val Gly Ala Gly Val Met Leu Leu Leu Met Val Lys Met Gly Hxs 

435 440 445 

Phe Phe Tyr Thr Leu Pro Asn Ala Val Leu Ala Gly He He Leu Ser 

450 455 460 

Asn Val He Pro Tyr Leu Glu Thr He Ser Asn Leu Pro Ser Leu Trp 
465 470 475 

Arg Gin Asp Gin Tyr Asp Cys Ala Leu Trp Met Met Thr Phe Ser Ser 

ser He Phe Le«4ly Leu Asp He Gly Leu He He Ser Val Val Ser 

500 505 510 

Ala Phe Phe He Thr Thr Val Arg Ser His Arg Ala Lys He Leu Leu 

515 520 525 

Leu Gly Gin He Pro Asn Thr Asn He Tyr Arg Ser He Asn Asp Tyr 

530 535 540 , 

Arg Glu He He Thr He Pro Gly Val Lys He Phe Gin Cys Cys Ser 
545 550 555 560 

Ser He Thr Phe Val Asn Val Tyr Tyr Leu Lys His Lys Leu Leu Lys 

565 570 575 

Glu val Asp Met Val Lys Val Pro Leu Lys Glu Glu Glu He Phe Ser 

580 585 590 

Leu Phe Asn Ser Ser Asp Thr Asn Leu Gin Gly Gly Lys He Cys Arg 

595 600 605 

Cys Phe Cys Asn Cys Asp Asp Leu Glu Pro Leu Pro Arg He Leu Tyr 

610 615 620 

Thr Glu Arg Phe Glu Asn Lys Leu Asp Pro Glu Ala Ser Ser He Asn 

635 640 



625 630 



Leu He His Cys Ser His Phe Glu Ser Met Asn Thr Ser Gin Thr Ala 

645 650 655 

ser Glu Asp Gin Val Pro Tyr Thr Val Ser Ser Val Ser Gin Lys Asn 

660 665 670 

Gin Gly Gin Gin Tyr Glu Glu Val Glu Glu Val Trp Leu Pro Asn Asn 

675 680 685 

Ser ser Arg Asn Ser Ser Pro Gly Leu Pro Asp Val Ala Glu Ser Gin 

690 695 700 

Gly Arg Arg Ser Leu He Pro Tyr Ser Asp Ala Ser Leu Leu Pro Ser. 
705 

val His Thr He He Leu Asp Phe Ser Met Val His Tyr Val Asp Ser 

725 730 735 

Ara Gly Leu Val Val Leu Arg Gin He Cys Asn Ala Phe Gin Asn Ala 

740 745 750 

Asn He Leu He Leu He Ala Gly Cys His Ser Ser He Val Arg Ala 

755 760 765 

Phe Glu Arg Asn Asp Phe Phe Asp Ala Gly He Thr Lys Thr Gin Leu 

770 775 780 

Phe Leu Ser Val His Asp Ala Val Leu Phe Ala Leu Ser Arg Lys Val 
785 790 795 800 

He Gly Ser Ser Glu Leu Ser He Asp Glu Ser Glu Thr Val He Arg 
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805 810 815 

Glu Thr Tyr Ser Glu Thr Asp Lys Asn Asp Asn Ser Arg Tyr Lys Met 

820 825 830 

Ser Ser Ser Phe Leu Gly Ser Gin Lys Asn Val Ser Pro Gly Phe He 

835 840 845 

Lys He Gin Gin Pro Val Glu Glu Glu Ser Glu Leu Asp Leu Glu Leu 

850 855 860 

Glu Ser Glu Gin Glu Ala Gly Leu Gly Leu Asp Leu Asp Leu Asp Arg 
865 870 875 880 

Glu Leu Glu Pro Glu Met Glu Pro Lys Ala Glu Thr Glu Thr Lys Thr 

885 890 
Gin Thr Glu Met Glu Pro Gin Pro Glu Thr Glu Pro Glu Met Glu Pro 

900 905 910 

Asn Pro Lys Ser Arg Pro Arg Ala His Thr Phe Pro Gin Gin Arg Tyr 

915 920 925 

Trp Pro Met Tyr His Pro Ser Met Ala Ser Thr Gin Ser Gin Thr Gin 

930 935 940 

Thr Arg Thr Trp Ser Val Glu Arg Arg Arg His Pro Met Asp Ser Tyr 
945 950 955 960 

Ser Pro Glu Gly Asn Ser Asn Glu Asp Val 
965 970 

<210> 8 
<2il> 3749 
<212> DNA 

<213> homo sapiens 
<400> 8 

ttttccaact ccccatctcc tccctcctca gattaaaaga agttatatgg actttgtgat 60 
gttttctgcc gctttgtgaa gtaggcctta tttctcttgt cctttcgtac agggaggaat 120 
ttgaagtaga tagaaaccga cctggattac tccggtctga actcagatca cgtaggactt 
taatcgttga acaaacgaac ctttaatagc ggctgcacca tcgggatgtc ctgatccaac 
atcgaggtcg taaaccctat tgttgatatg gactctagaa taggattgcg ctgttatccc 
tagggtaact tgttccgttg gtcaagttat tggatcaatt gagtatagta gttcgctttg 360 
actggtgaag tcttggcatg tactgctcgg aggttgggtt ctgctccgag gtcgccccaa 
ccgaaatttt taatgcagga gcgcccgcac tcccgccccc gccaaggagc caggaatggc 
acaactagag aggagcgcca tctctggctt cagctctaag tccaggcgaa actcattcgc 
atatgatgtt aagcgtgaag tatacaatga ggagaccttt caacaggaac acaaaaggaa 
ggcctcctct tctgggaaca tgaacatcaa catcaccacc ttcagacacc acgtccagtg 
ccgctgctca tggcacaggt tcctacgatg crtgcttaca atctttccct tcctagaatg 
gatgtgtatg tatcgattaa aggattggct tctgggagac ttacttgctg gtataagtgt 
tggccttgtg caagttcccc aaggcctgac acttagtttg ctggcaaggc aactgattcc 840 
tcctctcaac atcgcttatg cagctttctg ttcttcggta atctatgtaa tttttggatc 900 
gtgtcatcaa atgtccgttg gttccttctt cctggtgagt gctctgctga tcaacgttct 960 
gaaagtgagc ccattcaaca acggtcaact ggtcatggga tctttcgtca agaatgagtt 1020 
ttcggccccc tcctacctta tgggctataa taaatccttg agtgtggtgg caaccacaac lORQ 
ttttctgact gggattattc agctaataat gggcgtattg ggtttgggct tcattgccac 
ttaccttccg gagtctgcaa tgaatgctta cctggctgct gtggcacttc atatcatgct 
gtcccagctg actttcatct ttgggattat gattagtttc catgccggtc ccatctcctt 1260 
cttctatgac ataattaatt actgtgtagc tctcccaaaa gcgaattcca ccagcattct 1320 
agtatttcta actgttgttg ttgctctgcg aatcaacaaa tgtatcagaa tttctttcaa 1380 
tcagtatccc attgagtttc ccatggaatt atttctgatt attggcttca ctgtgattgc 1440 
aaacaagata agcatggcca cagaaaccag ccagacgctt attgacatga ttccttatag 1500 
ctttctgctt cctgtaacac cagatttcag ccttcttccc aagataattt tacaagcctt 1560 
ctccttatct ttggtgagct cctttctgct catatttctg ggcaagaaga ttgccagtct 1620 
tcacaattac agtgtcaatt ccaaccagga tttaatagcc atcggccttt gcaatgtcgt 1680 
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<210> 9 
<211> 1524 
<212> DNA 

<213> homo sapiens 



<400> 9 ^^4. 
atggggtgtt ggggtcggaa ccggggccgg ctgctgtgca tgctggcgct gaccttcatg 
ttcatggtgc tggaggtggt ggtgagccgg gtgacctcgt cgctggcgat gctctccgac 
tccttccaca tgctgtcgga cgtgctggcg ctggtggtgg cgctggtggc cgagcgcttc 
gcccggcgga cccacgccac ccagaagaac acgttcggct ggatccgagc cgaggtaatg 
ggggctctgg tgaacgccat cttcctgact ggcctctgtt tcgccatcct gctggaggcc 
atcgagcgct tcatcgagcc gcacgagatg cagcagccgc tggtggtcct tggggtcggc 
gtggccgggc tgctggtcaa cgtgctgggg ctctgcctct tccaccatca cagcggcttc 
agccaggact ccggccacgg ccactcgcac gggggtcacg gccacggcca cggcctcccc 
aaggggcctc gcgttaagag cacccgcccc gggagcagcg acatcaacgt ggccccgggc 
gagcagggtc ccgaccagga ggagaccaac accctggtgg ccaataccag caactccaac 
gggctgaaat tggaccccgc agacccagaa aaccccagaa gtggtgatac agtggaagta 
caagtgaatg gaaatcttgt cagagaacct gaccatatgg aactggaaga agatagggct 
ggacaactta acatgcgtgg agtttttctg catgtccttg gagatgcctt gggttcagtg 
attgtagtag taaatgcctt agtcttttac ttttcttgga aaggttgttc tgaaggggat 



1740 
1800 
1860 
1920 



cagttcattt ttcagatctt gtgtgtttac tggtgctatt gctaggacta ttatccagga 
taaatctgga ggaagacaac agtttgcatc tctggtaggc gcaggtgtga tgctgctcct 
gatggtgaag atgggacact ttttctacac actgccaaat gctgtgCtgg ctggtattat 
tctgagcaac gtcattccct accttgaaac catttctaac ctacccagcc tgtggaggca 1920 
ggaccaatat gactgtgctc tttggatgat gacattctca tcttcaattt tcctgggact 1980 
ggacattgga ctaattatct cagtagtttc tgctttcttc atcaccactg ttcgttcaca 2040 
cagagctaag attcttctcc tgggtcaaat ccctaacacc aacatttata gaagcatcaa 2100 
tgattatcgg gagatcatca ccattcctgg ggtgaaaatc ttccagtgct gcagctcaat 2160 
tacatttgta aatgtttact acctaaagca taagctgtta aaagaggttg atatggtaaa 2220 
ggtgcctctt aaagaagaag aaattttcag cttgtttaat tcaagtgaca ccaatctaca 2280 
aggaggaaag atttgcaggt gtttctgcaa ctgtgatgat ctggagccgc tgcccaggat 2340 
tctttacaca gagcgatttg aaaataaact ggatcccgaa gcatcctcca ttaacctgat 24O0 
tcactgctca cattttgaga gcatgaacac aagccaaact gcatccgaag accaagtgcc 2460 
atacacagta tcgtccgtgt ctcagaaaaa tcaagggcaa cagtatgagg aggtggagga 
agtttggctt cctaataact catcaagaaa cagctcacca ggactgcctg atgtggcgga 
aagccagggg aggagatcac tcatccctta ctcagatgcg tctctactgc ccagtgtcca 
caccatcatc ctggatttct ccatggtaca ctacgtggat tcacgggggt tagtcgtatt 2700 
aagacagata tgcaatgcct ttcaaaacgc caacattttg atactcattg cagggtgtca 2760 
ctcttccata gtcagggcat ttgagaggaa tgatttcttt gacgctggca tcaccaagac 2820 
ccagctgttc ctcagcgttc acgacgccgt gctgtttgcc ttgtcaagga aggtcatagg 
ctcctctgag ttaagcatcg atgaatccga gacagtgata cgggaaacct actcagaaac 
agacaagaat gacaattcaa gatataaaat gagcagcagt tttctaggaa gccaaaaaaa 
tgtaagtcca ggcttcatca agatccaaca gcctgtagaa gaggagtcgg agttggattt 
ggagctggaa tcagaacaag aggctgggct gggtctggac ctagacctgg atcgggagct 
ggagcctgaa atggagccca aggctgagac cgagaccaag acccagaccg agatggagcc 
ccagcctgag actgagcctg agatggagcc caaccccaaa tctaggccaa gagctcacac 
ttttcctcag cagcgttact ggcctatgta tcatccgtct atggcttcca cccagtctca 
gactcagact cggacatggt cagtggagag gagacgccat cctatggatt catactcacc 
agagggcaac agcaatgaag atgtctagga gatgaactag aaataagggg tcagataatg 
ctggcaaatc ctcctaccca aaaaggggtc aattgtccag agacctagac tggatacgaa 
ctagcagtac ttccttcctg actgtgactc ctactacctg ccagccttct tccttgctct 
gcgctgggat catactccca aatcacatta ctaaatgcca acaattatct ctgaattccc 
tatccaggct cccctcattt caccttcagc atatattcta gtcatgaatt tccttcttca 
cacaccccac atctctgggc tttgtgccag accatctcta acttaatcct ctcatccctg 
ttcccctttc tccaaagaga tgaagctca 
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ttttgtgtga atccatgttt ccctgacccc tgcaaagcat ttgtagaaat aattaatagt 
actcatgcat cactttatga ggctggtcct tgctgggtgc tatatttaga tccaactctt 
tgtgttgtaa tggtttgtat acttctttac acaacctatc cattacttaa ggaatctgct 
cttattcttc tacaaactgt tcctaaacaa attgatatca gaaatttgat aaaagaactt 
cgaaatgttg aaggagttga ggaagttcat gaattacatg tttggcaact tgctggaagc 
agaatcattg ccactgctca cataaaatgt gaagatccaa catcatacat ggaggtggct 
aaaaccatta aagacgtttt tcataatcac ggaattcacg ctactaccat tcagcctgaa 
tttgctagtg taggctctaa atcaagtgta gttccgtgtg aacttgcctg cagaacccag 
tgtgctttga agcaatgttg tgggacacta ccacaagccc cttatggaaa ggatgcagaa 
aagaccccag cagttagcat ttcttgttta gaacttagta acaatctaga gaagaagccc 
aggaggacta aagctgaaaa catccctgct gttgtgatag agattaaaaa catgccaaac 
aaacaacctg aatcatcttt gtga 

<210> 10 
<211> 507 
<212> PRT 

<213> bono sapiens 

<400> 10 , 

Met Gly Cys Trp Gly Arg Asn Arg Gly Arg Leu Leu Cys Met Leu Ala 

1 5 10 

Leu Thr Phe Met Phe Met Val Leu Glu Val Val Val Ser Arg Val Thr 

20 25 30 

Ser Ser Leu Ala Met Leu Ser Asp Ser Phe His Met Leu Ser Asp Val 

35 40 45 

Leu Ala Leu Val Val Ala Leu Val Ala Glu Arg Phe Ala Arg Arg Thr 

50 55 60 

His Ala Thr Gin Lys Asn Thr Phe Gly Trp lie Arg Ala Glu Val Met 
65 70 75 80 

Gly Ala Leu Val Asn Ala lie Phe Leu Thr Gly Leu Cys Phe Ala lie 

85 90 ^5 

Leu Leu Glu Ala He Glu Arg Phe He Glu Pro His Glu Met Gin Gin 

100 105 110 

Pro Leu Val Val Leu Gly Val Gly Val Ala Gly Leu Leu Val Asn Val 

115 120 125 

Leu Gly Leu Cys Leu Phe His His His Ser Gly Phe Ser Gin Asp Ser 

130 135 140 

Glv His Gly His Ser His Gly Gly His Gly His Gly His Gly Leu Pro 
145 150 155 160 

Lys Gly Pro Arg Val Lys Ser Thr Arg Pro Gly Ser Ser Asp He Asn 

165 170 175 

Val Ala Pro Gly Glu Gin Gly Pro Asp Gin Glu Glu Thr Asn Thr Leu 

180 185 190 

Val Ala Asn Thr Ser Asn Ser Asn Gly Leu Lys Leu Asp Pro Ala Asp 

195 200 205 

Pro Glu Asn Pro Arg Ser Gly Asp Thr Val Glu Val Gin Val Asn Gly 

210 215 220 

Asn Leu Val Arg Glu Pro Asp His Met Glu Leu Glu Glu Asp Arg Ala 
225 230 235 240 

Gly Gin Leu Asn Met Arg Gly Val Phe Leu His Val Leu Gly Asp Ala 

245 250 255 

Leu Gly Ser Val He Val Val Val Asn Ala Leu Val Phe Tyr Phe Ser 

260 265 270 

Trp Lys Gly Cys Ser Glu Gly Asp Phe Cys Val Asn Pro Cys Phe Pro 

275 280 285 

Asp Pro Cys Lys Ala Phe Val Glu He He Asn Ser Thr His Ala Ser 
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290 295 300 

Leu Tyr Glu Ala Gly Pro Cys Trp Val Leu Tyr Leu Asp Pro Thr Leu 
305 310 315 320 

CYS val Val Met Val Cys He Leu Leu Tyr Thr Thr Tyr Pro Leu Leu 

325 330 335 

Lys Glu Ser Ala Leu He Leu Leu Gin Thr Val Pro Lys Gin He Asp 

340 345 350 

He Arg Asn Leu He Lys Glu Leu Arg Asn Val Glu Gly Val Glu Glu 

355 360 365 

Val His Glu Leu His Val Trp Gin Leu Ala Gly Ser Arg He He Ala 

310 375 380 

Thr Ala His He Lys Cys Glu Asp Pro Thr Ser Tyr Met Glu Val Ala 
385 390 395 400 

LVS Thr He Lys Asp Val Phe His Asn His Gly He His Ala Thr Thr 

405 410 415 

He Gin Pro Glu Phe Ala Ser Val Gly Ser Lys Ser Ser Val Val Pro 

420 425 430 

Cys Glu Leu Ala Cys Arg Thr Gin Cys Ala Leu Lys Gin Cys Cys Gly 

435 440 445 

Thr Leu Pro Gin Ala Pro Tyr Gly Lys Asp Ala Glu Lys Thr Pro Ala 

450 455 460 

Val Ser He Ser Cys Leu Glu Leu Ser Asn Asn Leu Glu Lys Lys Pro 
465 470 475- 480 

Ara Arg Thr Lys Ala Glu Asn He Pro Ala Val Val He Glu He Lys 

485 490 495 

Asn Met Pro Asn Lys Gin Pro Glu Ser Ser Leu 
500 505 

<210> 11 
<211> 2222 
<212> DNA 

<213> homo sapiens 
<400> 11 

ctccggctgc ggctcttggt accccggctc cgggagccca gctccccgcc accgccgccg 
cctgggtgtg ggggctgctg aggctgagcc gggcttcggc gccggctctg aggacggacg 
cctgaggagc tgcgcggcgc ggcgccgccg gctggcggag aacgcccaca ggcgcggggc 
tcggcggctt gacccgggct tgtccccgtg cggccgcggg ggcccctcag cggtttcccg 
aacggcccga ctcgggcgct cctccgtgtc gcggtcgccg accctccgcg tcccgccaac 
gccgccgctg caccagtctc cgggccgggc tcggcgggcc ccgcagccgc agccatgggg 
tgttggggtc ggaaccgggg ccggctgctg tgcatgctgg cgctgacctt catgttcatg 
gtgctggagg tggtggtgag ccgggtgacc tcgtcgctgg cgatgctctc cgactccttc 
cacatgctgt cggacgtgct ggcgctggtg gtggcgctgg tggccgagcg cttcgcccgg 
cggacccacg ccacccagaa gaacacgttc ggctggatcc gagccgaggt aatgggggct 
ctggtgaacg ccatcttcct gactggcctc tgtttcgcca tcctgctgga ggccatcgag 
cqcttcatcg agccgcacga gatgcagcag ccgctggtgg tccttggggt cggcgtggcc 
gggctgctgg tcaacgtgct ggggctctgc ctcttccacc atcacagcgg cttcagccag 
gactccggcc acggccactc gcacgggggt cacggccacg gccacggcct ccccaagggg 
cctcgcgtta agagcacccg ccccgggagc agcgacatca acgtggcccc gggcgagcag 
ggtcccgacc aggaggagac caacaccctg gtggccaata ccagcaactc caacgggctg 
aaattggacc ccgcagaccc agaaaacccc agaagtggtg atacagtgga agtacaagtg 
aatggaaatc ttgtcagaga acctgaccat atggaactgg aagaagatag ggctggacaa 
cttaacatgc gtggagtttt tctgcatgtc cttggagatg ccttgggttc agtgattgta 
qtagtaaatg ccttagtctt ttacttttct tggaaaggtt gttctgaagg ggatttttgt 
gtgaatccat gtttccctga cccctgcaaa gcatttgtag aaataattaa tagtactcat 
gcatcacttt atgaggctgg tccttgctgg gtgctatatt tagatccaac tctttgtgtt 
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gtaatggttt gtatacttct ttacacaacc tatccattac ttaaggaatc tgctcttatt 
cttctacaaa ctgttcctaa acaaattgat atcagaaatt tgataaaaga acttcgaaat 
gttgaaggag ttgaggaagt tcatgaatta catgtttggc aacttgctgg aagcagaatc 
attgccactg ctcacataaa atgtgaagat ccaacatcat acatggaggt ggctaaaacc 
attaaagacg tttttcataa tcacggaatt cacgctacta ccattcagcc tgaatttgct 
agtgtaggct ctaaatcaag tgtagttccg tgtgaacttg cctgcagaac ccagtgtgct 1680 
ttgaagcaat gttgtgggac actaccacaa gccccttatg gaaaggatgc agaaaagacc 17 4 u 
ccagcagtta gcatttcttg tttagaactt agtaacaatc tagagaagaa gcccaggagg 
actaaagctg aaaacatccc tgctgttgtg atagagatta aaaacatgcc aaacaaacaa 
cctgaatcat ctttgtgagt cttgaaaaag atgtgatatt tgacttttgc tttaaactgc 
aagaggaaaa agactccact gaaattctaa gtttgccaag tagtgtaatt gaagtccttg 
tctggtcaca cagtttaatt ctatttttgt aagaacataa tgggactgca taacagagtt 
ctatattaca atttgtgatt attagtacag agtacagcta tgctgtgact gttttggaaa 
gccagtttta acactatgtt acatttttgt ttaaagtaag ttaaacctta tataacataa 
tgacatttga tttctggatt tttcccatgg ataaaaaatt aggggggata aaattaaaat 
tg 
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